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PROMOTER 



FIELD OF THE PRESENT INVENTION 

5 The present invention relates to a promoter, including a construct and an expression 
vector comprising the same and a transformed cell comprising the same. In addition 
the present invention relates to a plant cell, as well as a plant, comprising the same. 

BACKGROUND OF THE PRESENT INVENTION 

10 

Expression of plant genes is controlled in a complex pattern during the life cycle of a 
plant. Several processes are involved in this regulation of gene expression. The 
main steps are; the initiation of transcription; the termination of transcription; the 
processing of transcripts; the transport of mRNA to the ribosomes; and the 
1.5 translation. 

Three examples of plant genes that are expressed are found in Huang et af (1996 
Biosci Biotech Biochem 60 (2) 233-239) who report on three rice sucrose synthase 
isogenes, which are called RSusI, RSus2 and RSus3. The authors also report on their 
20 differential regulation of their expression. The authors state that the gene organisation 
patterns of RSus2 and RSus3 were the same. 

One of the major processes controlling gene expression is initiation of transcription. 
The transcription is initiated by binding of RNA polymerase together with several 
25 transcription factors to the promoter region. Specific regulatory DNA sequences (cis- 
elements) in the promoter serve as binding sites for these transcription factors (trans- 
acting factors). 

The cis-elements found in plant gene promoters can be divided in two categories. 

30 The first category comprises those cis-elements which are involved in initiation of 
transcription. The TATA box and the CAAT box are examples of proximal cis- 
elements involved in initiation of transcription. The CAAT box defines the binding site 
for the RNA polymerase, and the TATA box directs the RNA polymerase to the 
correct transcription start site. Presence of multiple CAAT boxes normally indicate a 

35 constitutive promoter. The TATA box and CAAT box are conserved among 
prokaryotes and eukaryotes, but are not essential for the function of some plant gene 
promoters. The second category is composed of cis-elements which are involved in 
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temporal and spatial regulation of gene expression. Genes encoding seed storage 
proteins (Glutamins. Legumins, Prolamins etc.) are examples of genes which are 
temporally and spatially regulated, and are thereby expressed in a tissue-specific and 
developmental manner. Examples of endosperm-specific cis-elements are the AACA 
5 motif and the endosperm box. These cis-elements. that contribute to tissue-specific 
and developmental expression of endosperm storage protein genes, are conserved 
among a wide range of seed storage protein genes. 

The manner in which complex patterns of transcription factors act on specific cis- 
1 0 elements determines whether genes are more or less constitutively expressed, or are 
expressed at specific times during development. Furthermore, these interactions 
determine whether expression occurs in a specific tissue e.g. the endosperm, is 
displayed by several tissues e.g. those of the seed, or is common to all parts of the 
plant . Multiple trans-acting factors can recognise variants of the cis-element 
consensus sec l uences and compete for binding, yielding complex expression 
patterns. Each plant gene promoter has a set of transcription factors and other 
trans-acting factors, which interact with the promoter sequence and the RNA 
polymerase and thereby regulate gene expression in an unique pattern. 



15 



It is known that it is desirable to direct expression of a nucleotide sequence of interest 
("NOI") in certain tissues of an organism - such as a plant. The NOI will typically 
encode a product of interest ("POI"). For example, it may be desirable to produce crop 
protein products with an optimised amino acid composition and so increase the nutritive 
value of the crop. It may even be desirable to use the crop to express non-plant genes 
such as genes for mammalian products. Examples of the latter products include 
interferons, insulin, blood factors and plasminogen activators. 



However, whilst it may be desirable to achieve expression of a NOI in certain tissues it 
is sometimes important (if not necessary) to ensure that the NOI is not expressed in 
30 other tissues in such a manner that detrimental effects may occur. Moreover, it is 
important not to upset the normal metabolism of the organism to such an extent that 
detrimental effects occur. For example, a disturbance in the normal metabolism in a 
plant's leaf or shoot could lead to stunted growth of the plant. 

35 An example of the use of plant promoters to cause expression of an NOI in plant tissue 
may be found in CA-A-2006454. which describes a DNA sequence of an expression 
cassette in which the potato tuber specific regulatory regions are localised. The 
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expression cassette contains a patatin-gene with a patatin-gene promoter. The DNA 
sequence is transferred into a plant genome using Agrobacterium. According to CA-A- 
2006454, the DNA sequence enables heterologous products to be prepared in crops. 

5 However, in plant transformation processes, it is generally the low efficiency of both 
transformation and regeneration that seriously slow vector development, since they 
limit the number of genetic constructs which can be tested. Investigations of the 
strength and tissue specificity of different transcriptional promoters, which can greatly 
influence the effect of the genetic manipulation, can be unmanageably labour-intensive 

10 if performed in stable transformants. This has resulted in a tendency to use strong 
constitutive promoters which are not tissue-specific to. direct transgenic expression in 
stable transformants, e.g. from viruses (cauliflower mosaic virus 35S promoter) and 
Agrobacterium (nopaline synthase promoter (NOS)). Not the least problematic element 
of this approach occurs when lowering of gene expression by antisense transcription is 

15 attempted, because gratuitous suppression of gene expression in all tissues can result 
in developmental retardation of non-target tissues, or other weakening of the 
transformant. Under such circumstances, only those transformants, in which the 
genetic construct poorly suppresses gene expression, would be expected to survive 
selection. There is, therefore, a strong argument for the use of tissge-specific 

20 transcriptional promoters for directing antisense transcription, but, in many crop 
species, characterising such promoters in stable transformants is not feasible owing to 
low efficiency of transformation and regeneration. 

Despite the fact that there are already some promoters available in the art, there is 
25 still a need to have additional promoters, in particular promoters that are efficient 
and/or selective in their ability to allow for the expression of a NOI. 

Thus, the present invention seeks to provide a promoter that is capable of causing the 
expression (transcription) of a NOI. 

More in particular, the present invention seeks to provide a promoter that is capable of 
directing the expression (transcription) of a nucleotide sequence of interest in specific 
tissues, or in just a specific tissue, of an organism, typically a plant. 

35 
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SUMMARY ASPECTS OF THE PRESENT INVENTION 

Aspects of the present invention are presented in the claims and in the following 
commentary. 

5 

In brief, some aspects of the present invention relate to: 

A novel promoter capable of selective expression. 
Novel promoter nucleotide sequences. 
Expression systems comprising said promoters. 
Methods of expression using said promoters. 
Transformed hosts/host cells comprising said promoters. 

As used with reference to the present invention, the terms "expression", "expresses", 
15 "expressed" and "expressable" are synonymous with the respective terms 
"transcription", "transcribes", "transcribed" and "transcribable". Hence, if the NOI is a 
coding sequence, then the product of its expression may also be called the 
transcription product and visa versa. Likewise, if the NOI is an anti-sense nucleotide 
sequence then the product of its transcription may also be called the expression 
20 product and visa versa. 

In the following commentary references to "nucleotide sequence of the present 
invention" include references to the "promoter of the present invention" and vice 
versa. Also, the term "nucleotide sequence of the present invention" is synonymous 
25 with the phrase "polynucleotide sequence of the present invention". 

Other aspects concerning the promoter and/or the nucleotide sequence of the present 
invention include: a construct comprising the sequences of the present invention; a 
vector comprising the sequences of the present invention; a plasmid comprising the 

30 sequences of present invention; a transformed cell comprising the sequences of the 
present invention; a transformed tissue comprising the sequences of the present 
invention; a transformed organ comprising the sequences of the present invention; a 
transformed host comprising the sequences of the present invention; a transformed 
organism comprising the sequences of the present invention. The present invention 

35 also encompasses methods of expressing NOIs using the same, such as expression in 
a host plant cell; including methods for transferring same. 



1. 
2. 

10 3. 
4. 

5. 
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For convenience, the promoter of the present invention is sometimes referred to as 
the RSus3 promoter (or even the Rsus3 promoter or even the RSus3 promoter). 
However, it is important to note that the promoter of the present invention is not 
disclosed in the teachings of Huang et al (ibid). Moreover, and contrary to the authors 
5 statement that "the gene organisation patterns of RSus2 and RSus3 were the same" 
we found that the promoter of RSus3 was quite different to that of RSus2. 

In this respect, we found a very low degree of homology (i.e. identity) between the 
RSus3 promoter and the RSus2 promoter and also the RSus3 and RSusI promoter - 
10 as evidenced by the following Table. 



Homology scores of RSusI, RSus2, RSus3 



Alignment 


% Homology 1 


% Homology 2 


RSus3 and RSusI 


7.7 % 


5.4 % 


RSus3 and RSus2 


4.6 % 


5.4% 


RSusI and RSus2 


4.6 % 


8.8 % 



15 homology scores based on multiple alignment (all of the promoter region upstream 
of the translational start codon including the intron: RSusI (2663bp), RSus2 
(2900bp), RSus3 (2667bp)). 

2 Homology scores based on multiple alignment {RSusI, RSus2 t RSus3 following 
20 excision of intron 1). 

The homology scores presented in the above table show the low degree of similarity 
between the promoter regions of RSusI , RSus2 and RSus3. 

25 The percentage homologies were calculated using the multiple alignment feature in 
DNASIS™ (Hitachi Software), based on an algorithm, analogous to CLUSTAL 
(Higgins DG & Sharp PM (1988), Gene 73(1), 237-244). 

In addition, and as a result of a sequence analysis of the sequences of the RSus3 
30 promoter and the RSusI promoter and the RSus2 promoter, we found that apart from 
the conserved TATA box and intron splice sites, they have no motifs in common. 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/GB00/0264 1 

6 

Thus, contrary to the prior art teachings, we found that the gene organisation patterns 
of RSus2 and RSus3 are not the same. 

For ease of reference, aspects of the present invention are now discussed under 
5 appropriate section headings. However, the teachings under each section are not 
necessarily limited to each particular section. 

DETAILED ASPECTS OF THE PRESENT INVENTION 

10 According to a first aspect of the present invention there is provided a promoter 
having the nucleotide sequence presented as SEQ ID No. 1, or a variant, 
homologue, fragment or derivative thereof. 

Alternatively expressed, the present invention provides a nucleotide sequence 
15 selected from: 



(a) the nucleotide sequence presented as SEQ ID No. 1; 

(b) a nucleotide sequence that is a variant, homologue, derivative or fragment of 
the nucleotide sequence presented as SEQ ID No. 1; 

20 (c) a nucleotide sequence that is the complement of the nucleotide sequence set 
out in SEQ ID No. 1 ; - ^ 

(d) a nucleotide sequence that is the complement of a variant, homologue, 
derivative or fragment of the nucleotide sequence presented as SEQ ID No. 1; 

(e) a nucleotide sequence that is capable of hybridising to the nucleotide 
25 sequen ce set out in SEQ ID No. 1; 

(f) a nucleotide sequence that is capable of hybridising to a variant, homologue, 
derivative or fragment of the nucleotide sequence presented as SEQ ID No. .1 ; 

(g) a nucleotide sequence that is the complement of a nucleotide sequence that 
is capable of hybridising to the nucleotide sequence set out in SEQ ID No. 1; 

30 (h) a nucleotide sequence that is the complement of a nucleotide sequence that 
is capable of hybridising to a variant, homologue, derivative or fragment of the 
nucleotide sequence presented as SEQ ID No. 1; 

(i) a nucleotide sequence that is capable of hybridising to the complement of the 
nucleotide sequence set out in SEQ ID No. 1; 
35 (j) a nucleotide sequence that is capable of hybridising to the complement of a 
variant, homologue, derivative or fragment of the nucleotide sequence presented as 
SEQ ID No. 1; 
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(k) a nucleotide sequence comprising any one of (a), (b), (c), (d), (e) t (f), (g), (h), 
(i), and/or 0). 

In a preferred aspect, the promoter is obtainable from (though it does not have to be 
5 actually obtained from) a plant of the genus Oryza, preferably from Oryza sativa. 

Another aspect of the present invention includes an isolated nucleotide sequence 
according to the present invention. 

10 Additional aspects of the present invention include uses of the promoter for expressing 
NOIs in vitro (e.g. in culture media such as a broth) and/or in vivo (e.g. in a transformed 
organism). 

In a highly preferred aspect the present invention provides a method for expressing an 
15 NOI in endosperm (preferably selectively expressing in endosperm), the _method 
comprising expressing the NOI when it is operably linked to the promoter of the present 
invention. 

The terms "selective" and "selectively" as used herein with respect to the present 
20 invention are synonymous respectively with the terms "specific" and "specifically". In 
this respect, selective or specific expression by the promoter of the present invention in 
the endosperm layer means that higher levels of expression occur in the endosperm 
relative to other tissue types or cells. 

25 In some instances, the NOI is highly selectively expressed in the endosperm. In this 
respect, highly selective or specific expression by the promoter of the present invention 
in the endosperm layer means that expression occurs predominantly in the endosperm 
relative to other tissue types or cells and in some cases almost exclusively in the 
endosperm layer. 

30 

Preferably the transformed host/host cells is/are plant/plant cells. 

The plant can be any suitable monocot plant (such as maize) or any suitable dicot plant 
(such as soya or guar). 

35 

Preferably the plant is an endospermous cereal or a legume. 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/G BOO/02641 

8 

in a preferred aspect, the plant is a member of the grass family - such as any one of 
wheat, maize, barley, oats, rye, or rice. 

In an alternative preferred aspect, the plant is a legume - such as any one of guar or 
5 locus bean. 

Hence, preferably, the transformed host is a transformed member of the taxonomic 
groups Gramineae (which may be known as Poaceae) or Leguminosae (which may be 
known as Fabaceae) or is a cell or tissue thereof. 

10 

Hence, preferably, the transformed plant is a transformed grass or a transformed 
legume. 

Also, preferably, the transformed plant cell is a transformed grass cell or a transformed 
15 legume cell. 



The present invention is advantageous for a number of reasons. 

By way of example, the present invention is advantageous because desirable levels of 
20 the expression product of a NOI can be obtained. Here, the NOI expression product 
may be, for example, a desired compound of benefit to humans or animals, e.g. a 
desirable foodstuff or an enzyme having a beneficial effect, such as a foodstuff 
processing effect or even a pharmaceutical effect. Furthermore, that product may be 
easily retrievable. 

25 

Alternatively, the NOI expression product may affect metabolism within the host. In 
some cases, the NOI expression product may be a component that is essential for 
metabolism within the host. In these instances, it may not be important for the POI to 
be retrieved from the host. In some cases, it may be important that the POI is not 
30 retrieved from the host. 

The present invention is also advantageous because it allows transformed plants or 
plant cells to express desirable levels of the product of expression of a NOI. 

35 The present invention is also advantageous because it allows transformed plants or 
plant cells to express in selective tissues or cell types desirable levels of the expression 
product of a NOI. 
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The promoter of the present invention is further advantageous as it can provide good 
expression levels of a NOI under conditions of transient expression in plant cells. 

5 In a preferred aspect, the promoter is linked to the sequence presented as SEQ ID No. 
. .2, or a variant, homologue, derivative or fragment thereof. The term "linked" includes 
direct or indirect (such as with the provision of suitable spacer sequence(s)) linkages. 

Preferably the sequence presented as SEQ ID No. 2, or a variant, homologue, 
10 derivative or fragment thereof is located in between the promoter of the present 
invention and the NOI. 

In this respect, we have found that in some transformed hosts, such as transformed 
plants (such as transformed guar), expression levels of the NOI can be elevated. This 
15 is im portan t if it is desirable t o have such a n elevated expression. 



The promoter may be used in conjunction with one or more other expression elements 
- which may be alternatively called "functional elements". These additional expression 
elements may be linked to the promoter of the present invention. The term "linked" 
20 includes direct or indirect (such as with the .provision of suitable spacer sequence(s)) 
linkages. ~ ^ 

The additional expression element may enhance expression or inhibit undesirable 
expression. 

25 

The additional expression element may even be a promoter, wherein that promoter 
may be the same as or different to the promoter of the present invention, or even a part 
thereof. 

30 The present invention also encompasses repeating units of promoters - such as 
tandem repeats that comprise at least two promoter elements - one of which will be the 
promoter of the present invention - such as three promoter elements. 

Preferably the additional expression elements are located intermediate the promoter of 
35 the present invention and the NOI. Should it be desirable, the additional expression 
elements and the promoter of the present invention may be separated by suitable 
restriction sites. 
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In this respect, we have found that in some transformed hosts, such as transformed 
plants (such as transformed guar), expression levels of the NOI can be elevated. This 
is important if it is desirable to have such an elevated expression. 

5 

In one aspect, the NOI is an antisense nucleotide sequence. 

In this respect, preferably the NOI is a sequence that is antisense to all or part of the 
gene encoding an epimerase, in particular a UDP galactose epimerase, more in 
10 particular a UDP galactose-4-epimerase (EC 5.1.3.3). 

If the NOI is a sequence that is antisense to all or part of the gene encoding a UDP 
galactose epimerase - more in particular a UDP galactose-4-epimerase (EC 5.1.3.3) - 
then expression of that antisense sequence by the promoter of the present invention 
15 could affect the galactose units (such as causing a decrease thereof) on a 
galactomannan or other extracellular polysaccharides. 

In an alternative aspect, the NOI is a sense nucleotide sequence. 

20 Here, the NOI may be a sequence that constitutes all or part of the gene encoding a 
UDP galactose epimerase, more in particular a UDP galactose^-epimerase (EC 

— 5.1.3.3). 

PROMOTER 

25 

Thus, the present invention relates to a novel regulatory sequence, namely a 
promoter - which we have called the RSus3 promoter. 

The term "promoter is used in the normal sense of the art, e.g. an RNA polymerase 
30 binding site. 

The promoter may be the same as the naturally occurring form - for this aspect, 
preferably the promoter is not present in its natural environment. In addition, or in 
the alternative, the promoter is in an isolated and/or in a purified form. The promoter 
35 of the present invention can be a variant, homologue, fragment or derivative of the 
naturally occurring promoter. The promoter can be obtainable from or produced by 
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any suitable source, whether natural or not, or it may be synthetic, semi-synthetic or 
recombinant. 



A nucleotide sequence comprising the promoter of the present invention and other 
5 associated nucleotide sequences is schematically presented in Figure 1 (which is not 
to scale). 

In this respect, the promoter sequence SEQ ID No. 1 is shown as Box A. As can be 
seen, Box A comprises two units - which have been shown diagramatically as Box B 
10 and Box C. Box B corresponds to SEQ ID No. 6 and Box C corresponds to SEQ ID 
No. 4. SEQ ID No. 6 is the sequence up to the TATA box. SEQ ID No. 4 is the first 
part of an exon sequence. 

In accordance with the present invention we have found that expression may still be 
15 achieved in some host cells with Ihe promoter of the present invention in the a bse n ce 
of all or part of SEQ ID No. 4. In this respect, the promoter of the present invention 
would be represented as SEQ ID No. 6 and the comments concerning SEQ ID No. 1 
would equally apply to SEQ ID No. 6. 

20 Figure 1 also shows Box D - which corresponds to SEQ ID No. 2. SEQ ID No. 2 is 
an intron sequence. In accordance with the present invention we have found that 
surprisingly expression can still be achieved with the promoter of the present 
invention in the absence of all or part of SEQ ID No. 2. Thus, in one embodiment of 
the present invention the promoter of the present invention is not used in conjunction 

25 with all or part of SEQ ID No. 2. 

However, in some instances we have surprisingly found that SEQ ID No. 2 can 
elevate expression levels of an NOI and/or increase the selectivity of the expression. 
Hence, in an alternative preferred embodiment of the present invention the promoter 
30 of the present invention is used in conjunction with all or part of SEQ ID No. 2. 

Figure 1 also shows Box E - which corresponds to SEQ ID No. 5. SEQ ID No. 5 is 
the second part of the exon sequence associated with SEQ ID No. C. For some 
applications, the promoter sequence of the present invention is contained within a 
35 nucleotide sequence wherein SEQ ID No. 4 is fused to SEQ ID No. 5. 
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Thus, schematically, and by way of example, the promoter of the present invention 
may be represented as any one or more of: Box A; Box B with or without Box C; Box 
A with Box D; Box B - with or without Box C - with Box D; Box A with Box E; Box B - 
with or without Box C - with Box E; Box A with Box D and Box E; Box B - with or 
5 without Box C - with Box D and Box E. 

It is to understood that: Box A represents SEQ ID No. 1 or a variant, homologue, 
fragment or derivative thereof, preferably, Box A represents SEQ ID No. 1; Box B 
represents SEQ ID No. 6 or a variant, homologue, fragment or derivative thereof, 
preferably, Box B represents SEQ ID No. 6; Box C represents SEQ ID No. 4 or a 
variant, homologue, fragment or derivative thereof, preferably, Box C represents 
SEQ ID No. 4; Box D represents SEQ ID No. 2 or a variant, homologue. fragment or 
derivative thereof, preferably, Box D represents SEQ ID No. 2; Box E represents 
SEQ ID No. 5 or a variant, homologue, fragment or derivative thereof, preferably, 
Box E represents SEQ ID No. 5. 



In our analysis of the promoter of the present invention we have identified a number 
of interesting expression elements/functional elements which resemble consensus 
sequences or parts thereof. 

20 

These identified sequences are presented in Table 1 below. 



10 



15 
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TABLE 1 



Functional 


Posi 


Posi 


RSus3 


Consensus 


Reference 


elements in 


tion 


tion 


sequence 1 


sequence 


(Consensus 


the RSus3 




(- 






sequence) 


promoter 




intr 








region 




on) 








Translation 


+ 1 


+ 1 


CAATGG 


CAATGG 


[Joshi 1987] 


start site 












Intron 1 


-27 




TCCAG | GC 


tgcagIgt 


[Simpson & 


(Acceptor 










Filipowicz 


splice site) 










1996] , 












consensus for 












mono cots 


intron 1 


-892 




AG | GTAGAG 


AG | GTAAGT 




(Donor splice 












site) 












TATA box 


-986 


-121 


TATAAATA 


TATATATA 


[Joshi 1987] 


CAAT box 


-999 


-134 


GCACATTTT 


GGNCAATCT 




GCN4 box 


_ 


-167 


GTGAGGCAG 


(G/A) TGA(G/C)TC 


[Muller & 




1032 






A (T/G) 


Knudsen 1993] , 












cis- element 












involved in 












endosperm 












specificity 


Endosperm 




-207 


AGTAAAG 


TG (T/ C/A) AAA (G/ 


[Marzabal et 


boxes 


1072 






A) 


al . 1998] , cis- 












element 












involved in 












endosperm 












specificity 






-265 


TGCAAAC 








1130 












_ 


-484 


TGTCAAA 








1349 










Legumin boxes 


_ 


-674 


CGTGCATG 


CATGCATG 


[Baumlein et 


(RY repeats) 


1539 








al. 1992] 












cis-element 












involved in 












seed 












specificity 






-721 


CATGTATG 








1586 














-759 


CATGCATA 








1624 














-842 


CATGCATG CAT 








1707 




G 












CATGCCTG 








2115 


1250 














CAGGCATGCAT 








2480 


1615 


C 







Deviations from the consensus sequence are in cursive 

We currently believe that at least one or more, such as all, of those identified 
sequences that are the same as or similar to the consensus sequences should be 
present in the promoter sequence of the present invention. 
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As indicated above, the promoter can additionally include or be used with features to 
ensure or to increase expression in a suitable host. For example, the features can be 
conserved regions such as a Pribnow Box or a TATA box. 

5 The promoter may even contain or be used with other sequences to affect (such as to 
maintain, enhance, decrease) the levels of expression of the nucleotide sequence of 
the present invention. For example, suitable other sequences may include the SM- 
intron or an ADH intron. Other sequences may include inducible elements - such as 
temperature, chemical, light or stress inducible elements. Also, suitable elements to 
10 enhance transcription or translation may be present. An example of the latter element 
is the TMV 5* signal sequence (see Sleat Gene 217 [1987] 217-225; and Dawson Plant 
Mol. Biol. 23 [1993] 97). 

The promoter of the present invention may be used in combination with one or more 
15 other expression elements or functional elements or regulatory elements. 



The terms "expression elements*, "functional elements" and "regulatory elements" 
include enhancers, expression regulation signals, secretion leader sequences, 
promoter consensus sequences, terminator sequences, and may even include other 
20 promoters. 

The present invention also encompasses hybrid promoters that comprise at least a 
part of the promoter of the present invention and at least a part of another promoter. 

25 Hybrid promoters may also be used to improve inducible regulation of the expression 
construct. 

As indicated above, the present invention also encompasses tandem promoters 
wherein at least one of the promoters is the promoter of the present invention. If the 
30 other promoter is a promoter of the present invention or part thereof then that 
combination may be called a tandem repeat. 

The other promoter may even be another promoter or part thereof. By way of 
example, the other promoter may be selected for its efficiency in directing the 
35 expression of the NOI in the desired expression host. 
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In one embodiment, a constitutive promoter may be selected to direct the expression 
of the NOI. Such an expression construct may provide additional advantages since it 
may circumvent the need to culture the expression hosts on a medium containing an 
inducing substrate. Examples of strong constitutive and/or inducible promoters which 
5 are preferred for use in, for example, fungal expression hosts are those which are 
obtainable from the fungal genes for xylanase (xlnA), phytase, ATP-synthetase, 
subunit 9 (o//C), triose phosphate isomerase (tpi) t alcohol dehydrogenase (AdhA), 
a-amylase (amy), amyloglucosidase (AG - from the glaA gene), acetamidase (amdS) 
and glyceraldehyde-3-phosphate dehydrogenase (gpd) promoters. Examples of 

10 strong yeast promoters are those obtainable from the genes for alcohol 
dehydrogenase, 3-phosphoglycerate kinase and triosephosphate isomerase. 
Examples of strong bacterial promoters are the a-amylase and SP02 promoters as 
well as promoters from extracellular protease genes. Examples of strong plant 
promoters are the CaMV promoter and the SV40 35S promoter and the NOS 

15 promoter. 

For some applications, preferably the promoter is stably incorporated within the 
transformed organism's genome. 

20 The term "transformed" is synonymous with the term "transgenic". 

In a preferred aspect, the prbmoter is linked to the sequence presented as SEQ ID No. 
2, or a variant, homologue, derivative or fragment thereof. 

25 NATURALLY OCCURRING 

As used herein "naturally occurring" refers to the promoter sequence found in nature 
- i.e. the wild type promoter. 
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ISOLATED/PURIFIED 

As used herein, the terms "isolated" and "purified" refer to nucleic acid sequences, 
that are removed from their natural environment and isolated or separated from at 
5 least one other component with which they are naturally associated. 

BIOLOGICALLY ACTIVE 

As used herein "biologically active" refers to a promoter according to the present 
10 invention - such as a recombinant promoter - having a similar structural function (but 
not necessarily to the same degree), and/or similar regulatory function (but not 
necessarily to the same degree), and/or similar biochemical function (but not 
necessarily to the same degree) of the naturally occurring promoter. Specifically, a 
promoter of the present invention has the ability to express a NOI in endosperm, 
15 preferably selectively in endosperm. 

DELETION 

As used herein, a "deletion" is defined as a change in the nucleotide sequence in 
20 which one or more nucleotides are absent. 

INSERTION/ADDITION 

As used herein, an "insertion" or "addition" is a change in the nucleotide sequence 
25 which has resulted in the addition of one or more nucleotides as compared to the 
naturally occurring promoter. 

SUBSTITUTION 

30 As used herein, "substitution" results from the replacement of one or more 
nucleotides or by one or more different nucleotides. 

VARIANT/HOMOLOGUE 

35 The terms "variant" or "homologue" with respect to the nucleotide sequence of the 
present invention are synonymous with allelic variations of the sequences. 
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In particular, the term "homology" as used herein may be equated with the term 
"identity". Here, sequence homology with respect to the nucleotide sequence of the 
present invention can be determined by a simple "eyeball" comparison (i.e. a strict 
comparison) of any one or more of the sequences with another sequence to see if 
5 that other sequence has at least 75% identity to the sequence(s). Relative sequence 
homology (i.e. sequence identity) can also be determined by commercially available 
computer programs that can calculate % homology between two or more sequences. 
A typical example of such a computer program is CLUSTAL. 

10 Sequence homology (or identity) may even be determined using any suitable 
homology algorithm, using for example default parameters. Advantageously, the 
BLAST algorithm is employed, with parameters set to default values. The BU\ST 
algorithm is described in detail at http://www.ncbi.nih.gov/BLAST/blast_help.html, 
which is incorporated herein by reference. The search parameters are defined as 

J5 follows, and are advantageously set to the def ined . def a ult parameters. 

Advantageously, "substantial homology" when assessed by BLAST equates to 
sequences which match with an EXPECT value of at least about 7, preferably at 
least about 9 and most preferably 10 or more. The default threshold for EXPECT in 
20 BLAST searching is usually 10. 

BLAST (Basic Local Alignment Search Tool) is the heuristic search algorithm 
employed by the programs biastp, blastn, blastx, tblastn, and tblastx; these programs 
ascribe significance to their findings using the statistical methods of Karlin and 

25 Altschul (see http://www.ncbi.nih.gov/BLAST/blast_help.html) with a few 
enhancements. The BLAST programs were tailored for sequence similarity 
searching, for example to identify homologues to a query sequence. The programs 
are not generally useful for motif-style searching. For a discussion of basic issues in 
similarity searching of sequence databases, see Altschul et al (1994) Nature 

30 Genetics 6: 1 1 9-1 29. 

The five BLAST programs available at http://www.ncbi.nlm.nih.gov perform the 
following tasks: 

35 biastp compares an amino acid query sequence against a protein sequence 
database; 
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blastn compares a nucleotide query sequence against a nucleotide sequence 
database; 

blastx compares the six-frame conceptual translation products of a nucleotide query 
5 sequence (both strands) against a protein sequence database; 

tblastn compares a protein query sequence against a nucleotide sequence database 
dynamically translated in all six reading frames (both strands). 

10 tblastx compares the six-frame translations of a nucleotide query sequence against 
the six-frame translations of a nucleotide sequence database. 

BLj^ST uses the following search parameters: 

!? HISTOGRAM Display ...a .Jnfagg&m.MjEQt£S. for each search; default is yes. (See 
parameter H in the BLAST Manual). 

DESCRIPTIONS Restricts the number of short descriptions of matching sequences 
reported to the number specified; default limit is 100 descriptions. (See parameter V 
20 in the manual page). See also EXPECT and-CUTOFF. . ... 

ALIGNMENTS Restricts database sequences to the number specified for which high- 
scoring segment pairs (HSPs) are reported; the default limit is 50. If more database 
sequences than this happen to satisfy the statistical significance threshold for 
25 reporti ng (see EX PECT and CUTOFF below), only the matches ascribed the greatest 
statistical significance are reported. (See parameter B in the BLAST Manual). 

EXPECT The statistical significance threshold for reporting matches against 
database sequences; the default value is 10, such that 10 matches are expected to 
30 be found merely by chance, according to the stochastic model of Karlin and Altschul 
(1990). If the statistical significance ascribed to a match is greater than the EXPECT 
threshold, the match will not be reported. Lower EXPECT thresholds are more 
stringent, leading to fewer chance matches being reported. Fractional values are 
acceptable. (See parameter E in the BLAST Manual). 

' '35 

CUTOFF Cutoff score for reporting high-scoring segment pairs. The default value is 
calculated from the EXPECT value (see above). HSPs are reported for a database 
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sequence only if the statistical significance ascribed to them is at least as high as 
would be ascribed to a lone HSP having a score equal to the CUTOFF value. Higher 
CUTOFF values are more stringent, leading to fewer chance matches being 
reported. (See parameter's in the BLAST Manual). Typically, significance thresholds 
5 can be more intuitively managed using EXPECT. 

MATRIX Specify an alternate scoring matrix for BLASTP, BLASTX, TBLASTN and 
TBLASTX. The default matrix is BLOSUM62 (Henikoff & Henikoff, 1992). The valid 
alternative choices include: PAM40, PAM120, PAM250 and IDENTITY. No alternate 
io scoring matrices are available for BLASTN; specifying the MATRIX directive in 
BLASTN requests returns an error response. 

STRAND Restrict a TBLASTN search to just the top or bottom strand of the 
database sequences; or restrict a BLASTN, BLASTX or TBLASTX search to just 
15 reading frames on the top or bottom strand of the query sequence. 

FILTER Mask off segments of the query sequence that have low compositional 
complexity, as determined by the SEG program of Wootton & Federhen (1993) 
Computers and Chemistry 17:149-163, or segments consisting of short-periodicity 

20 internal repeats, as determined by the XNU program of Claverie & States (1993) 
Computers and Chemistry 17:191-201, or, for BLASTN, by the DUST program of 
Tatusov and Lipman (see http://www.ncbi.nlm.nih.gov). Filtering can eliminate 
statistically significant but biologically uninteresting reports from the blast output 
(e.g., hits against common acidic-, basic- or proline-rich regions), leaving the more 

25 biologically interesting regions of the query sequence available for specific matching 
against database sequences. 

Low complexity sequence found by a filter program is substituted using the letter "N" 
in nucleotide sequence (e.g., "NNNNNNNNNNNNN") and the letter "X" in protein 
30 sequences (e.g., "XXXXXXXXX"). 

Filtering is only applied to the query sequence (or its translation products), not to 
database sequences. Default filtering is DUST for BLASTN, SEG for other programs. 

35 It is not unusual for nothing at all to be masked by SEG, XNU, or both, when applied 
to sequences in SWISS-PROT, so filtering should not be expected to always yield an 
effect. Furthermore, in some cases, sequences are masked in their entirety, 
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indicating that the statistical significance of any matches reported against the 
unfiltered query sequence should be suspect. 

NCBI-gi Causes NCBI gi identifiers to be shown in the output, in addition to the 
5 accession and/or locus name. 

Preferably, sequence comparisons are conducted using the simple BLAST search 
algorithm provided at http://www.ncbi.nlm.nih.gov/BLAST. 

10 Other computer program methods to determine identify and similarity between the 
two sequences include but are not limited to the GCG program package (Devereux 
et al 1984 Nucleic Acids Research 12: 387 and FASTA (Atschul et af 1990 J Molec 
Biol 403-410). 

15 ^ ^e u sed vyhen^^d identity, then preferably the 

following parameters are used: 



FOR BLAST 




GAP OPEN 


0 


GAP EXTENSION 


0 




FOR CLUSTAL 


DNA 


WORD SIZE 


2 


GAP PENALTY 


10 


GAP EXTENSION 


0.1 



20 Most preferably, sequence comparisons are conducted using DNASIS™ 

As used herein, the terms "variant", "homologue", "fragment" and "derivative** embrace 
allelic variations of the sequences. 

25 The term "variant" also encompasses sequences that are complementary to 
sequences that are capable of hydridising to the nucleotide sequences presented 
herein. 
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HYBRIDISATION 

The term "hybridisation" (sometimes written as "hybridization") as used herein shall 
include "the process by which a strand of nucleic acid joins with a complementary 
5 strand through base pairing" (Coombs J (1994) Dictionary of Biotechnology, Stockton 
Press, New York NY) as well as the process of amplification as carried out in 
polymerase chain reaction (PCR) technologies as described in Dieffenbach CW and 
GS Dveksler (1995, PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, 
Plainview NY). 

10 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic 
acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular 
Cloning Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego 
CA), and confer a defined "stringency" as explained below. 

Stringency of hybridisation refers to conditions under which polynucleicacids hybrids 
are stable. Such conditions are evident to those of ordinary skill in the field. As 
known to those of skill in the art, the stability of hybrids is reflected in the melting 
temperature (Tm) of the hybrid which decreases approximately 1 to 1.5°C with every 
20 1% decrease in sequence homology. In general, the stability of a hybrid is a function 
of sodium ion concentration and temperature. Typically, the hybridisation reaction is 
performed under conditions of higher stringency, followed by washes of varying 
stringency. 

25 As use d herein, h igh stringency refers to conditions that permit hybridisation of only 
those nucleic acid sequences that form stable hybrids in 1 M Na+ at 65-68 °C 

Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of the 
probe). 

30 

High stringency typically occurs at about 5°C to 10°C below the Tm of the probe. 
High stringency conditions can be provided, for example, by hybridisation in an 
aqueous solution containing 6x SSC, 5x Denhardfs, 1 % SDS (sodium dodecyl 
sulphate), 0.1 Na+ pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as 
35 non specific competitor. Following hybridisation, high stringency washing may be 
done in several steps, with a final wash (about 30 min) at the hybridisation 
temperature in 0.2 - 0.1x SSC, 0.1 % SDS. 
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Moderate, or intermediate, stringency typically occurs at about 10°C to 20°C below 
the Tm of the probe. 

5 Low stringency typically occurs at about 20°C to 25°C below the Tm of the probe. 

As will be understood by those of skill in the art, a maximum stringency hybridization 
can be used to identify or detect identical polynucleotide sequences, while an 
intermediate (or low) stringency hybridization can be used to identify or detect similar 
10 or related polynucleotide sequences. 

Moderate stringency refers to conditions equivalent to hybridisation in the above 
described solution but at about 60-62°C. In that case the final wash is performed at 
the hybridisation temperature in 1x SSC, 0.1 % SDS. 



Low stringency refers to conditions equivalent to hybridisation in the above described 
solution at about 50-52°C. In that case, the final wash is performed at the 
hybridisation temperature in 2x SSC, 0.1 % SDS. 

20 It is understood that these conditions may be adapted and duplicated using a variety 
of buffers, e.g. formamide-based buffers, and temperatures. Denhardt's solution and 
SSC are well known to those of skill in the art as are other suitable hybridisation 
buffers (see, e.g. Sambrook, et a/., eds. (1989) Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, New York or Ausubel, et a/., eds. 

25 (1990) Current Protocols in Molecular Biology, John Wiley & Sons, Inc.). Optimal 
hybridisation conditions have to be determined empirically, as the length and the GC 
content of the probe also play a role. 

NUCLEOTIDE SEQUENCE 

30 

The term "nucleotide sequence" as used herein refers to an oligonucleotide 
sequence or polynucleotide sequence, and variants, homologues, fragments and 
derivatives thereof (such as portions thereof). The nucleotide sequence may be of 
genomic or synthetic or recombinant origin which may be double-stranded or single- 
35 stranded whether representing the sense or antisense strand. 

Preferably, the term "nucleotide sequence" means DNA. 
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In a preferred embodiment, the nucleotide sequence per se of the present invention 
does not cover the native nucleotide sequence according to the present invention in its 
natural environment when it is linked to its naturally associated sequence(s) that is/are 
5 also in its/their natural environment. For ease of reference, we shall call this preferred 
embodiment the "non-native nucleotide sequence". 

Typically, the nucleotide sequence of the present invention is prepared using 
recombinant DNA techniques (i.e. recombinant DNA). However, in an alternative 
10 embodiment of the invention, the nucleotide sequence could be synthesized, in 
whole or in part, using chemical methods well known in the art (see Caruthers MH et 
al (1980) Nuc Acids Res Symp Ser 215-23, Horn T et al (1980) Nuc Acids Res Symp 
Ser 225-232). 

1 5 The present inve ntio n also encompasses nucleotide sequences tha t are 
complementary to the sequences presented herein, or any derivative, fragment or 
derivative thereof. If the sequence is complementary to a fragment thereof then that 
sequence can be used as a probe to identify similar sequences in other organisms etc. 

20 The present invention also encompasses nucleotide sequences that are capable of 
hybridising to the sequences presented herein, or any derivative, fragment or derivative 
thereof. 

The present invention also encompasses nucleotide sequences that are capable of 
25 hybridising to the sequences that are complementary to the sequences presented 
herein, or any derivative, fragment or derivative thereof. 

The term "variant" also encompasses sequences that are complementary to 
sequences that are capable of hydridising to the nucleotide sequences presented 
30 herein. 

Preferably, the term "variant" encompasses sequences that are complementary to 
sequences that are capable of hydridising under stringent conditions (e.g. 65°C and 
O.lxSSC {1xSSC = 0.15 M NaCI, 0.015 Na 3 citrate pH 7.0}) to the nucleotide 
35 sequences presented herein. 
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The present invention also relates to nucleotide sequences that can hybridise to the 
nucleotide sequences of the present invention (including complementary sequences of 
those presented herein). 

5 The present invention also relates to nucleotide sequences that are complementary to 
sequences that can hybridise to the nucleotide sequences of the present invention 
(including complementary sequences of those presented herein). 

Also included within the scope of the present invention are polynucleotide sequences 
10 that are capable of hybridizing to the nucleotide sequences presented herein under 
conditions of intermediate to maximal stringency. 

In a preferred aspect, the present invention covers nucleotide sequences that can 
hybridise to the nucleotide sequence of the present invention under stringent conditions . 
15 (e.g. 65°C and CMxSSC). 



Advantageously, the invention provides nucleic acid sequences which are capable of 
hybridising, under stringent conditions, to a fragment of SEQ. ID. No. 1. Preferably, 
the fragment is between 15 and 50 bases in length. Advantageously, it is about 25 
20 bases in length. 

— By knowledge of the nucleic acid sequences set out herein it is possible to devise 
partial and full-length nucleic acid sequences such as cDNA and/or genomic clones. 
Nucleic acid sequences obtained by PCR - such as fragments of the full length 

25 sequence, preferably fragments having unique sequences - may then be used to 
obtain same or similar sequences using hybridization library screening techniques. 
The fragments may be from 10 to 100 nucleotides long. Preferably, the fragments 
may be from 1 5 to 90 nucleotides long. More preferably, the fragments may be from 
20 to 80 nucleotides long. 

30 

By way of example, a PCR clone may be labelled with radioactive atoms and used to 
screen a genomic library from other species, preferably other plant species. 
Hybridization conditions will typically be conditions of medium to high stringency (for 
example 0.03M sodium chloride and 0.03M sodium citrate at from about 50°C to 
35 about 60°C). 
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Degenerate nucleic acid probes encoding all or part of the amino acid sequence may 
also be used to probe cDNA and/or genomic libraries from other species, preferably 
other plant species or fungal species. However, it is preferred to carry out PCR 
techniques initially to obtain a single sequence for use in further screening 
5 procedures. 

Polynucleotide sequences of the present invention obtained using the techniques 
described above may be used to obtain further homologous sequences and variants 
using the techniques described above. 

10 

Thus, polynucleotides of the present invention may be used to produce a primer, e.g. 
a PCR primer, a primer for an alternative amplification reaction, a probe e.g. labelled 
with a revealing label by conventional means using radioactive or non-radioactive 
labels, or the polynucleotides may be cloned into vectors. Such primers, probes and 
15 other fragments will be at least 15, preferably at least 20, for example at least 25, 30 
or 40 nucleotides in length, and are also encompassed by the term polynucleotides 
of the present invention as used herein. 

Polynucleotides or primers of the present invention may carry a revealing label. 

20 Suitable labels include radioisotopes such as 32 P or 35 S, enzyme labels, or other 
labels such as biotin or digoxigenin. The DIG™ system (Boehringer Mannheim) is 
useful as it offers a very attractive non-radioactive system. The DIG system is based 
on the steroid hapten digoxigenin, which occurs exclusively in Digitalis plants and 
thus avoids endogenous background problems as in the case of other haptens, such 

25 as biotin} Such labels may be added to polynucleotides or primers of the present 
invention and may be detected using by techniques known perse. 

Polynucleotides such as a DNA polynucleotide and primers according to the present 
invention may be produced recombinantiy, synthetically, or by any means available to 
30 those of skill in the art. They may also be cloned by standard techniques. In 
general, primers will be produced by synthetic means, involving a step wise 
manufacture of the desired nucleic acid sequence one nucleotide at a time. 
Techniques for accomplishing this using automated techniques are readily available 
in the art. 

35 

Longer polynucleotides will generally be produced using recombinant means, for 
example using PCR cloning techniques. This will involve making a pair of primers 
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(e.g. of about 15-30 nucleotides) to a region of the nucleotide sequence which it is 
desired to clone, bringing the primers into contact with mRNA or cDNA obtained from 
a fungal, plant or prokaryotic cell, performing a polymerase chain reaction under 
conditions which bring about amplification of the desired region, isolating the 
5 amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and 
recovering the amplified DNA. The primers may be designed to contain suitable 
restriction enzyme recognition sites, so that the amplified DNA can be cloned into a 
suitable cloning vector. 

I0 The nucleotide sequence of the present invention may be engineered in order to alter 
its activity for a number of reasons, including but not limited to, alterations which 
modify the processing and/or expression. For example, mutations may be introduced 
using techniques which are well known in the art, e.g., site-directed mutagenesis to 
insert new restriction sites, to alter giycosylation patterns or to change codon 

15 preference. By_vray^Lfyrth.er example, the nucleotide sequence of the present 
invention may also be modified to optimise expression in a particular host cell - such 
as the inclusion of additional promoter(s) or parts thereof, such as the provision of 
tandem repeats, and/or the provision of other expression elements such as the intron 
sequence. Other sequence changes may be desired in order to introduce restriction 
20 enzyme recognition sites. 

VARIANT, HOMOLOGUE AND FRAGMENT OF THE PROMOTER NUCLEOTIDE 
SEQUENCE 

25 The terms "variant", "homologue" or "fragment" in relation to the nucleotide sequence 
of the present invention include any substitution of, variation of, modification of, 
replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence has the ability to act as a 
promoter, preferably being at least as biologically active as the promoter having the 

30 sequence shown as SEQ ID No. 1. In particular, the term "homologue" covers 
homology with respect to structure and/or function providing the resultant nucleotide 
sequence has the ability to act as a promoter. With respect to sequence homology, 
preferably there is at least 75%, more preferably at least 85%, more preferably at least 
90% homology to the sequence shown as SEQ ID No. 1. More preferably there is at 

35 least 95%, more preferably at least 98%, homology to the sequence shown as SEQ ID 
No. 1. 
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The present invention also relates to DNA segments comprising the DNA sequence of 
SEQ ID No. 1 or allelic variations of such sequences. These segments are capable of 
acting as a regulatory region/unit. 



5 VARIANT, HOMOLOGUE AND FRAGMENT OF NUCLEOTIDE SEQUENCE SEQ ID 
NO. 2 

The terms "variant", "homologue" or "fragment" in relation to the nucleotide sequence 
of the present invention include any substitution of, variation of, modification of, 

10 replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence has the ability to increase 
expression levels, preferably being at least as biologically active as the sequence 
shown as SEQ ID No. 2. In particular, the term "homologue" covers homology with 
respect to structure and/or function providing the resultant nucleotide sequence has the 

is ability. tQjncre„ase. jg^F^jonJsyisds. VV)tb.te5pj3i^Jo..s.^u9ACiB! homology, preferably 

there is at least 75%, more preferably at least 85%, more preferably at least 90% 
homology to the sequence shown as SEQ ID No. 2. More preferably there is at least 
95%, more preferably at least 98%, homology to the sequence shown as SEQ ID No. 
2. 

20 

The present invention also relates to DNA segments comprising the DNA sequence of 
SEQ ID No. 2 or allelic variations of such sequences. These segments are capable of 
increasing expression levels. 

25 VARIANT, HOMOLOGUE AND FRAGMENT OF ANY ONE OR MORE OF THE 
IDENTIFIED NUCLEOTIDE SEQUENCES IN TABLE 1 

The terms "variant", "homologue" or "fragment" in relation to any one of the nucleotide 
sequences presented in Table 1 include any substitution of, variation of, modification 

30 of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence has the ability to influence 
expression. In particular, the term "homologue" covers homology with respect to 
structure and/or function providing the resultant nucleotide sequence has the ability to 
influence expression. With respect to sequence homology, preferably there is at least 

35 75%, more preferably at least 85%, more preferably at least 90% homology to the 
respective sequence shown in Table 1. More preferably there is at least 95%, more 
preferably at leas|98%, homology to the respective sequence shown in Table 1. 
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NOI/POI 

In a preferred aspect, the present invention relates to the use of the promoter of the 
5 present invention to express one or more suitable NOIs. 

Thus, in a preferred aspect, the promoter is operably linked to a NOI. 

The term "operably linked" refers to a relationship - such as in a suitable juxtaposition 
to - wherein the components described are in a relationship permitting them to function 
in their intended manner. A regulatory sequence "operably linked" to a NOI is ligated 
in such a way that expression of the coding sequence is achieved under condition 
compatible with the control sequences. 

15 The NOI can encode for a POI. 



The NOI can be any suitable sequence encoding a polypeptide of interest, other than 
the complete natural sequence normally associated with the promoter of the present 
invention when the NOI is operably linked to the promoter in their natural environment. 

20 

Preferably, the NOI does not include all of the nucleotide sequence naturally 
associated with the wild type promoter of the present invention. 

The NOI can be any nucleotide sequence that is either foreign (heterologous) or 
25 natural (homologous) to the organism in question - which may be a filamentous fungus 
or a plant. 

Typically, the POI may be any suitable prokaryotic or eukaryotic heterologous or 
homologous peptide or protein of interest. 

30 

For some applications, the POI may be secreted and/or retrieved. 
For some applications, it is important that the POI is secreted and/or retrieved. 
35 For some applications, the POI may not secreted and/or retrieved. 

For some applications, it is important that the POI is not secreted and/or retrieved. 
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The NOI may even be a sequence which is capable of expressing a nucleic acid, for 
example a regulatory RNA such as an antisense RNA or a ribozyme, an mRNA, or a 
tRNA or rRNA capable of regulating the metabolism of an organism. 

5 

The NOI may also be a homologous nucleotide sequence which has been mutated, 
such as by insertion, addition, deletion or alteration, such that it is no longer identical 
with the natural homologous nucleotide sequence. 

10 The POI can be a single-chain polypeptide molecule as well as a multiple-polypeptide 
complex where individual constituent polypeptides are linked by covalent or non- 
covalent means. Here, the term "polypeptide" includes peptides of two or more 
amino acids in length, typically having more than 5, or more than 10 or more than 20 
amino acids. 

15 

Typical examples of a NOI include sequences coding for proteins and enzymes that 
modify metabolic and catabolic processes. The heterologous nucleotide sequence 
may code for an agent for introducing or increasing pathogen resistance. The 
heterologous nucleotide sequence may be an antisense construct for modifying the 
20 expression of natural transcripts present in the relevant tissues. 

The heterologous nucleotide sequence may be a protein giving nutritional value to a 
food or crop. Typical examples include plant proteins that can aid human or animal 
digestion, inhibit the formation of anti-nutritive factors and those plant proteins that 
25 have a more desirable amino acid composition (e.g. a higher lysine content than a non- 
transgenic plant). 

Non-limiting examples of POIs include, for example, proteins involved in the regulation 
of cell division, for example growth factors including neurotrophic growth factors, 
30 cytokines (such as a- p- or y-interferon, interleukins including IL-1, IL-2, tumour 
necrosis factor, or insulin-like growth factors I or II), protein kinases (such as MAP 
kinase), protein phosphatases and cellular receptors for any of the above. 

The POI may also be an enzyme involved in cellular metabolic pathways, for 
35 example enzymes involved in carbohydrate biosynthesis or degradation, amino acid 
biosynthesis or degradation (such as tyrosine hydroxylase), purine or pyrimidine 
biosynthesis or degradation, and the biosynthesis or degradation of 
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neurotransmitters, such as dopamine, or a protein involved in the regulation of such 
pathways, for example protein kinases and phosphatases. 

The POI may also be effective in the post-harvest processing of plants - e.g. in the 
5 brewing or baking processes. 

The POI may also be a transcription factors or proteins involved in their regulation, 
for example pocket proteins of the Rb family such as Rb or p107, membrane 
proteins, structural proteins or heat shock proteins such as hsp70.. 

10 

The NOI may code for an intron of a particular nucleotide sequence, wherein the intron 
can be in sense or antisense orientation. 

Non-limiting examples of POIs include: proteins or enzymes involved in starch 
15 metabolism, proteins or enzymes involved in glycogen metabolism, acetyl esterases, 
aminopeptidases, amylases, arabinases, arabinofuranosidases, carboxypeptidases, 
catalases, cellulases, chitinases, chymosin, cutinase, deoxyribonucleases, epimerases, 
esterases, ct-galactosidases, p-galactosidases, a-glucanases, glucan lysases, endo- 
3-glucanases, glucoamylases, glucose oxidases, a-glucosidases, (3-glucosidases, 
20 glucuronidases, hemicellulases, hexose oxidases, hydrolases, invertases, isomerases, 
laccases, lipases, lyases, fnannosidases, oxidases, oxidoreductases, pectate lyases, 
pectin acetyl esterases, pectin depolymerases, pectin methyl esterases, pectinolytic 
enzymes, peroxidases, phenoloxidases, phytases, polygalacturonases, proteases, 
rhamno-galacturonases, ribonucleases, thaumatin, transferases, transport proteins, 
25 transgl utaminases , xylanases, or combinations thereof. The NOI may even be an 
antisense sequence for any of those sequences. 

The NOI can be the nucleotide sequence coding for the exo-amylase enzyme which is 
the subject of PCT patent application PCT/IB99/00649 (incorporated herein by 
30 reference). 

The NOI can be the nucleotide sequence coding for the xylanase enzymes and 
mutants thereof which are the subject of UK patent application GB 99078057 
(incorporated herein by reference). 

35 
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The NOI can be the nucleotide sequence coding for the arabinofuranosidase enzyme 
which is the subject of PCT patent application PCT/EP96/01009 (incorporated herein 
by reference). 

5 The NOI can be any of the nucleotide sequences coding for the ADP-glucose 
pyrophosphorylase enzymes which are the subject of PCT patent application 
PCT/EP94/01082 (incorporated herein by reference). 



10 



15 



20 



The NOI can be any of the nucleotide sequences coding for the a-glucan lyase enzyme 
which are described in PCT patent application PCT/EP94/03397 (incorporated herein 
by reference). 

The NOI can be any of the sequences coding for T. lanuginosus amylase, as described 
in PCT patent application PCT/EP95/02607, incorporated herein by reference. 

The NOI can be any of the nucleotide sequences coding for the glucanase enzyme 
which are described in PCT patent application PCT/EP96/01008 (incorporated herein 
by reference). 

The NOI can be any of the nucleotide sequences coding for the UDP-galactose 
epimerase enzyme, as well as anti-sense sequences therefor, - such as those which 
are described in PCT patent application WO-A-98/54335 (incorporated herein by 
reference). 

25 The NOI can be hox from the red algae Chondrus crispus or HpA from Aspergillus 
niger. 

The POI can be a PME as disclosed in WO-A-97/03574 or the PME disclosed in either 
WO-A-94/25575 or WO-A-97/31 102 as well as variants, derivatives or homologues of 
30 the sequences disclosed in those patent applications. 

The POI may even be a fusion protein, for example to aid in extraction and 
purification. 

35 Examples of fusion protein partners include the maltose binding protein, glutathione- 
s-transferase (GST). 6xHis. GAL4 (DNA binding and/or transcriptional activation 



SUBSTITUTE SHEET (RULE 26) 

3NSDOCID: <WO 0078975A2_I_> 



Best Available Copy 



WO 00/78975 PCT/G BOO/02641 

32 

domains) and p-galactosidase. It may also be convenient to include a proteolytic 
cleavage site between the fusion components. 

The POI may even be fused to a secretion sequence. Examples of secretion leader 
5 sequences are those originating from the amyloglucosidase gene, the a-factor gene, 
the a-amylase gene, the lipase A gene, the xylanase A gene. 

Other sequences can also facilitate secretion or increase the yield of secreted POI. 
Such sequences could code for chaperone proteins as for example the product of 
10 Aspergillus nigercyp B gene described in UK patent application 9821198.0. 

The NOI may be engineered in order to alter their activity for a number of reasons, 
including but not limited to, alterations which modify the processing and/or 
expression of the expression product thereof. For example, mutations may be 

15 introduced using techniques which are well known in ..the art. e.g., site-directed 

mutagenesis to insert new restriction sites, to alter glycosylation patterns or to 
change codon preference. By way of further example, the NOI may also be modified 
to optimise expression in a particular host cell. Other sequence changes may be 
desired in order to introduce restriction enzyme recognition sites. 

20 

The NOI may include within it synthetic or modified nucleotides. A number of 
different types of modification to oligonucleotides are known in the art. These include 
methylphosphonate and phosphorothioate backbones, addition of acridine or 
polylysine chains at the 3' and/or 5* ends of the molecule. For the purposes of the 
25 present invention, it is to be understood that the NOI may be modified by any method 
available in the art. Such modifications may be carried out in to enhance the in vivo 
activity or life span of the NOI. 

The NOI may be modified to increase intracellular stability and half-life. Possible 
30 modifications include, but are not limited to, the addition of flanking sequences of the 
5' and/or 3" ends of the molecule or the use of phosphorothioate or 2' O-methyl rather 
than phosphodiesterase linkages within the backbone of the molecule. 

CONSTRUCTS 

35 

The term "construct 0 - which is synonymous with terms such as "conjugate 11 , "cassette" 
and "hybrid" - includes the nucleotide sequence according to the present invention 
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directly or indirectly attached to a NOI. An example of an indirect attachment is the 
provision of a suitable spacer group such as an intron sequence, such as the Sht- 
intron or the ADH intron, intermediate the promoter and the NOI. The same is true for 
the term "fused" in relation to the present invention which includes direct or indirect 
5 attachment. In each case, the terms do not cover the natural combination of the wild 
type gene promoter and its associated nucleotide sequence when they are both in their 
natural environment. 

The construct may even contain or express a marker which allows for the selection of 
10 the genetic construct in, for example, a plant cell into which it has been transferred. 

The selectable marker means may reside on an additional vector or may be included 
in the nucleic acid molecule which contains the expression system. The nature of the 
selectable marker means will depend on the nature of the host and the culture 
[ 5 condlt ' c>n ?; I he m _ ost suitable selection syst^ are 
those formed by the group of selection markers which do not require a mutation in 
the host organism. 

US-A-5358864 provides a short list of suitable selectable marker genes that may be 
20 used in the present invention - examples of .which include fungal selection markers 
such as those that are the genes for acetamidase (amdS), ATP synthetase, subunit 
9 (oliC) and benomyl resistance (benA). 

Examples of non-fungal selection markers are the bacterial G418 resistance gene 
25 (this ma y also be used in yeast, but not in fungi), the ampicillin resistance gene (E. 
co//), the neomycin resistance gene (Bacillus) and the E.coli uidA gene, coding for 
p-glucuronidase (GUS). 

In certain aspects of the present invention, use of the ble marker - which confers 
30 resistance to phleomycin/bleomycin/zeocin - may be preferred. However, other 
selection markers, known in the art, could be used. Examples of auxotrophic 
markers are pyrG selecting for uridine prototrophs, argB selecting for arginine 
prototrophs, niaD selecting for nitrate prototrophs, trpC selecting for tryptophan 
prototrophs, amdS selecting for increased utilisation of acetamide as sole nitrogen 
35 source. Dominant resistance markers could be chosen from otlC3 conferring 
resistance to oligomycin, hph conferring resistance to hygromycin B, bar conferring 
resistance to bialaphos or NPTll conferring resistance to G418. 
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The construct may even contain or express a marker which allows for the selection of 
the genetic construct in, for example, a bacterium, preferably of the genus Bacillus, 
such as Bacillus subtilis, or plants, such as potatoes, sugar beet etc., into which it has 
5 been transferred. 

The construct may even contain or express a marker which allows for the selection of 
the genetic construct in, for example, a plant. Various markers exist which may be 
used, such as for example those encoding mannose 6-phosphate isomerase, 
10 glucosamine 6-phosphate deaminase/ketoisomerase, xylose isomerase, or those 
markers that provide for antibiotic resistance - e.g. resistance to G418, hygromycin, 
bleomycin, kanamycin and gentamycin. 

VECTORS 

15 

The term "vector" includes expression vectors, replicable vectors, trahsformatiori 
vectors and shuttle vectors, including vector combinations thereof. 

The term "expression vector" means a construct capable of in vivo or in vitro 
20 expression. 

Preferably the expression vector is incorporated in the genome of the organism. The 
term "incorporated" preferably covers stable incorporation into the genome. 

25 Preferably, the vector of the present invention comprises a construct according to the 
present invention. Alternatively expressed, preferably the promoter of the invention 
is present in a vector and wherein the promoter is operably linked to a NOI such that 
the promoter is capable of providing for the expression of the coding sequence by a 
suitable host organism, i.e. the vector is an expression vector. 

30 

The term "transformation vector" means a construct capable of being transferred from 
one entity to another entity - which may be of the species or may be of a different 
species. If the construct is capable of being transferred from one species to another - 
such as from an E.coli plasmid to a bacterium, such as of the genus Bacillus, then the 
35 transformation vector is sometimes called a "shuttle vector". It may even be a 
construct capable of being transferred from an E.coli plasmid to an Agrobacterium to a 
plant. 
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The vectors of the present invention may be transformed into a suitable host cell to 
provide for expression of a POI. 



10 



15 



Thus, in a further aspect the invention provides a process for preparing a POI 
invention which comprises cultivating a host cell transformed or transfected with an 
expression vector under conditions to provide for expression by the promoter of the 
present invention of a NOI ..encoding the POI, and optionally recovering the 
expressed POL 

Typically, the nucleotide sequences of the invention may be incorporated into a 
recombinant replicable vector. The vector may be used to replicate the nucleic acid 
in a compatible host organism. 



Thus ' in 3 further embodirTlen ^ the invention provides a method of introducing a 
nucleotide sequence of the invention into a replicable vector, introducing the vector 
into a compatible host organism, and growing the host organism under conditions 
which bring about expression of a NOI by the promoter of the present invention. The 
POI may then be recovered from the host organism. Suitable host organisms include 
20 plants or plant cells. 



The NOI may be incorporated into a replicable vector, for example a cloning or 
expression vector, which comprises a promoter of the present invention. The vector 
may be used to replicate the NOI in a compatible host cell. 

25 

Thus, in a further embodiment, the invention provides a method of making NOIs by 
introducing a NOI into a replicable vector which comprises the promoter of the 
present invention, introducing the vector into a compatible host cell, and growing the 
host cell under conditions which bring about replication of the vector. The vector may 
30 be recovered from the host cell. 

The vectors may be for example, plasmid, virus or phage vectors. In addition to the 
promoter of the present invention, the vectors may be provided with any one or more 
of an origin of replication, a NOI operably linked to the promoter for the expression of 
35 the NOI, and a regulator of the promoter. 

Vectors may be used, for example, to transfect or transform a host organism either in 
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Such vectors may be transformed or transfected into a suitable host organism to 
provide for expression of a protein of the invention. This process may comprise 
5 culturing a host organism transformed with an expression vector as described above 
under conditions to provide for expression by the vector of a coding sequence 
encoding the protein, and optionally recovering the expressed protein. 

Vectors of the invention may be introduced into host organisms for the purpose of 
10 replicating the vectors/nucleotide sequences and/or expressing the NOI. In one 
preferred aspect, the host organism is a plant cell. 

Vectors of the present invention may introduced into suitable host organisms using a 
variety of techniques known in the art, such as transfection, transformation and 
15 electroporation. Another technique is the protoplast transformation method (Winer et 
a/., Microbiology, 1985, 468, American Society for Microbiology). 

Vectors may be used in vitro, for example for the production of RNA or used to 
transfect or transform a host cell. 

20 

TISSUE 

The term "tissue" as used herein includes tissue per se and organ. 
25 HOST CELLS 

The term "host cell" - in relation to the present invention includes any cell that could 
comprise the promoter of the present invention and/or products obtained therefrom, 
wherein the promoter can allow expression of a NOl when present in the host cell. 

30 

Thus, a further embodiment of the present invention provides host cells transformed 
or transfected with a . promoter of the present invention. Preferably the promoter is 
carried in a vector for the replication and expression of NOIs. The cells will be 
chosen to be compatible with the said vector and may for example be prokaryotic (for 
35 example bacterial), fungal, yeast or plant cells. 

Preferably, the host cell is a plant cell. 
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The gram-negative bacterium E. coli is widely used as a host for heterologous gene 
expression. However, large amounts of heterologous protein tend to accumulate 
inside the cell. Subsequent purification of the desired protein from the bulk of E.coli 
5 intracellular proteins can sometimes be difficult. 



In contrast to E.coli, bacteria from the genus Bacillus are very suitable as 
heterologous hosts because of their capability to secrete proteins into the culture 
medium. Other bacteria suitable as hosts are those from the genera Streptomyces 
10 and Pseudomonas. 



Depending on the nature of the NOI encoding the POI, and/or the desirability for 
further processing of the expressed protein, eukaryotic hosts such as yeasts or fungi 
may be preferred. In general, yeast cells are preferred over fungal cells because 
the Y are ^nipulate. However, some proteins are either poorly secreted 

from the yeast cell, or in some cases are not processed properly (e.g. 
hyperglycosylation in yeast). In these instances, a fungal host organism should be 
selected. 



20 Examples of suitable expression hosts within the scope of the present invention are 
fungi such as Aspergillus species (such as those described in EP-A-01 84438 and 
EP-A-0284603) and Trichoderma species; bacteria such as Bacillus species (such as 
those described in EP-A-01 34048 and EP-A-0253455), Streptomyces species and 
Pseudomonas species; and yeasts such as Kluyveromyces species (such as those 

25 described in EP-A-0096430 and EP-A-0301670) and Saccharomyces species. By 
way of example, typical expression hosts may be selected from Pichia pastoris, 
Hansenufa polymorpha t Aspergillus niger, Aspergillus niger var. tubigenis, 
Aspergillus niger var. awamori, Aspergillus aculeatis, Aspergillus nidulans, 
Aspergillus orvzae, Trichoderma reese/\ Bacillus subtilis, Bacillus licheniformis, 

30 Bacillus amyloliquefaciens, Kluyveromyces lactis and Saccharomyces cerevisiae. 

The use of suitable host cells - such as yeast, fungal and plant host cells - may 
provide for post-translational modifications (e.g. myristolation, glycosylate, 
truncation, lapidation and tyrosine, serine or threonine phosphorylation) as may be 
35 needed to confer optimal biological activity on recombinant expression products of 
the present invention. 
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Thus, the present invention also provides a method of transforming a host cell with a 
nucleotide sequence shown as SEQ ID No. 1 or a derivative, homologue, variant or 
fragment thereof. 

5 Host cells transformed with a promoter according to the present invention operably 
linked to a NOI may be cultured under conditions suitable for the expression and 
recovery of the POI from cell culture. The POI may be secreted or may be contained 
intracellular^ depending on the sequence and/or the vector used. As will be 
understood by those of skill in the art, expression vectors containing a promoter 

10 according to the present invention operably linked to a NOI can be designed with 
signal sequences which direct secretion of the NOI/POI through a particular 
prokaryotic or eukaryotic cell membrane. Other recombinant constructions may join 
the NOI to a nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 12:441-53). 

15 

ORGANISM 

The term "organism" in relation to the present invention includes any organism that 
could comprise the nucleotide sequence of the present invention, wherein the promoter 
20 can allow expression of a NOI when present in the organism. Examples of organisms 
may include a fungus, yeast or a plant. 

The term "transgenic organism" in relation to the present invention includes any 
organism that comprises the nucleotide sequence of the present invention, wherein the 
25 promoter can allow expression of a NOI within the organism. Preferably the nucleotide 
sequence is incorporated in the genome of the organism. 

The term "transgenic organism" does not cover the native nucleotide sequence 
according to the present invention in its natural environment when it is operably linked 
30 to its associated coding sequence which is also in its natural environment. 

Therefore, the transgenic organism of the present invention includes an organism 
comprising any one of, or combinations of, the nucleotide sequence of the present 
invention, constructs according to the present invention (including combinations 
35 thereof), vectors according to the present invention, plasmids according to the present 
invention, cells according to the present invention, tissues according to the present 
invention or the products thereof. The transformed cell or organism could prepare 
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acceptable quantities of the POL In some instances, the POI may be easily retrievable 
from, the cell or organism. 

TRANSFORMATION OF HOST CELLS/HOST ORGANISMS 

5 

As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. 
Examples of suitable prokaryotic hosts include £. coli and Bacillus subtilis. Teachings 
on the transformation of prokaryotic hosts is well documented in the art, for example 
see Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold 
10 Spring Harbor Laboratory Press) and Ausubel et a/.. Current Protocols in Molecular 
Biology (1995), John Wiley & Sons, Inc. 

If a prokaryotic host is used then the NOI may need to be suitably modified before 
transformation - such as by removal of introns. 

15 

TRANSFORMED FUNGUS 

A host organism may be a fungus - such as a mold. Examples of suitable such hosts 
include any member belonging to the genera Thermomyces, Acremonium, 
20 Aspergillus, Penicillium, Mucor, Neurospora, Trichoderma and the like - such as 
Thermomyces lanuginosis, Acremonium chrysogenum, Aspergillus niger x Aspergillus 
oryzae, Aspergillus awamori, Penicil/inum chrysogenem, Mucor javanious, 
Neurospora crassa, Trichoderma viridae and the like. 

25 In one embodiment, the host organism may be a filamentous fungus. 

For almost a century, filamentous fungi have been widely used in many types of 
industry for the production of organic compounds and enzymes. For example, 
traditional Japanese koji and soy fermentations have used Aspergillus sp. Also, in this 
30 century Aspergillus niger has been used for production of organic acids particular citric 
acid and for production of various enzymes for use in industry. 

There are two major reasons why filamentous fungi have been so widely used in 
industry. First filamentous fungi can produce high amounts of extracellular products, 
35 for example enzymes and organic compounds such as antibiotics or organic acids. 
Second filamentous fungi can grow on low cost substrates such as grains, bran, beet 
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pulp etc. The same reasons have made filamentous fungi attractive organisms as 
hosts for heterologous expression according to the present invention. 



In order to prepare the transgenic Aspergillus, expression constructs are prepared by 
5 inserting the nucleotide sequence according to the present invention (and optionally the 
NOI) into a construct designed for expression in filamentous fungi. 

Several types of constructs used for heterologous expression have been developed. 
These constructs preferably contain one or more of: a signal sequence which directs 
10 the POI to be secreted, typically being of fungal origin, and a terminator (typically being 
active in fungi) which ends the expression system. 

Another type of expression system has been developed in fungi where the nucleotide 
sequence according to the present invention (and optionally the NOI) can be fused to a 

15 smaller or a larger p a rt of a [fungal gene jencqdjng_a_ stable protein. This can stabilise 
the POI. In such a system a cleavage site, recognised by a specific protease, can be 
introduced between the fungal protein and the POI, so the produced fusion protein can 
be cleaved at this position by the specific protease thus liberating the POI. By way of 
example, one can introduce a site which is recognised by a KEX-2 like peptidase found 

20 in at least some AspergillL Such a fusion leads to cleavage in vivo resulting in 
production of the expressed product and not a larger fusion protein. 

Heterologous expression in Aspergillus has been reported for several genes coding for 
bacterial, fungal, vertebrate and plant proteins. The proteins can be deposited 
25 intracellular^ if the nucleotide sequence according to the present invention (or even the 
NOI) is not fused to a signal sequence. Such proteins will accumulate in the cytoplasm 
and will usually not be glycosylated which can be an advantage for some bacterial 
proteins. If the nucleotide sequence according to the present invention (or even the 
NOI) is equipped with a signal sequence the protein will accumulate extracellularly. 

30 

With regard to product stability and host strain modifications, some heterologous 
proteins are not very stable when they are secreted into the culture fluid of fungi. Most 
fungi produce several extracellular proteases which degrade heterologous proteins. To 
avoid this problem special fungal strains with reduced protease production have been 
35 used as host for heterologous production. 
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Teachings on transforming filamentous fungi are reviewed in US-A-5741665 which 
states that standard techniques for transformation of filamentous fungi and culturing 
the fungi are well known in the art. An extensive review of techniques as applied to 
A/, crassa is found, for example in Davis and de Serres, Methods Enzymol (1971) 
5 17A:79-143. Standard procedures are generally used for the maintenance of strains 
and the preparation of conidia. Mycelia are typically grown in liquid cultures for about 
14 hours (25°C), as described in Lambowitz et a/., J Cell Biol (1979) 82:17-31. Host 
strains can generally be grown in either VogePs or Fries minimal medium 
supplemented with the appropriate nutrient(s), such as, for example, any one or 
10 more of: his, arg, phe, tyr, trp, p-aminobenzoic acid, and inositol. 

Further teachings on transforming filamentous fungi are reviewed in US-A-5674707 
which states that once a construct has been obtained, it can be introduced either in 
linear form or in plasmid form, e.g., in a pUC-based or other vector, into a selected 

J_5 filamentous fungal host using a technique such as DNA-mediated transformation, 

electroporation, particle gun bombardment, protoplast fusion and the like. In 
addition, Ballance 1991 (ibid) states that transformation protocols for preparing 
transformed fungi are based on preparation of protoplasts and introduction of DNA into 
the protoplasts using PEG and Ca 2 * ions. The transformed protoplasts then regenera- 

20 te and the transformed fungi are selected using various selective markers. 



To allow for selection of the resulting transformants, the transformation typically also 
involves a selectable gene marker which is introduced with the expression cassette, 
either on the same vector or by co-transformation, into a host strain in which the 

25 gene marker is selectable. Various marker/host systems are available, including the 
pyrG, argB and niaD genes for use with auxotrophic strains of Aspergillus nidulans; 
pyrG and argB genes for Aspergillus oryzae auxotrophs; pyrG, trpC and niaD genes 
for Penicillium chrysogenum auxotrophs; and the argB gene for Trichoderma reesei 
auxotrophs. Dominant selectable markers including amdS, oliC, hyg and phleo are 

30 also now available for use with such filamentous fungi as A. niger, A. oryzae, A. 
ficuum t P. chrysogenum, Cephalosporium acremonium, Cochliobolus 
heterostrophus, Giomerella cingulata, Fulvia fulva and Leptosphaeria maculans (for a 
review see Ward in Modern Microbial Genetics, 1991, Wiley-Liss, Inc., at pages 
455-495). A commonly used transformation marker is the amdS gene of A. nidulans 

35 which in high copy number allows the fungus to grow with acrylamide as the sole 
nitrogen source. 
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For the transformation of filamentous fungi, several transformation protocols have been 
developed for many filamentous. Among the markers used for transformation are a 
number of auxotrophic markers such as argB, trpC, niaD and pyrG, antibiotic 
resistance markers such as benomyl resistance, hygromycin resistance and 
5 phleomycin resistance. 

In one aspect, the host organism can be of the genus Aspergillus, such as Aspergillus 
niger. 

10 A transgenic Aspergillus according to the present invention can also be prepared by 
following the teachings of Rambosek, J. and Leach, J. 1987 (Recombinant DNA in 
filamentous fungi: Progress and Prospects. CRC Crit. Rev. Biotechnol. 6:357-393), 
Davis R.W. 1994 (Heterologous gene expression and protein secretion in Aspergillus. 
In: Martinelli S.D., Kinghorn J.R.( Editors) Aspergillus: 50 years on. Progress in 

15 industrial microbiology vol 29. Elsevier Amsterdam 1994. pp 525-560), Ballance, D.J. 
1991 (Transformation systems for Filamentous Fungi and an Overview of Fungal Gene 
structure. In: Leong, S.A., Berka R.M. (Editors) Molecular Industrial Mycology. Systems 
and Applications for Filamentous Fungi. Marcel Dekker Inc. New York 1991. pp 1-29) 
and Turner G. 1994 (Vectors for genetic manipulation. In: Martinelli S.D., Kinghorn 

20 J.R.( Editors) Aspergillus: 50 years on. Progress in industrial microbiology vol 29. 
Elsevier Amsterdam 1994. pp. 641-666)r ^ 

TRANSFORMED YEAST 

25 In another embodiment the transgenic organism can be a yeast. 



In this regard, yeast have also been widely used as a vehicle for heterologous gene 
expression. 

30 By way of example, the species Saccharomyces cerevisiae has a long history of 
industrial use, including its use for heterologous gene expression. Expression of 
heterologous genes in Saccharomyces cerevisiae has been reviewed by Goodey et al 
(1987, Yeast Biotechnology, D R Berry et a/ t eds, pp 401-429, Allen and Unwin, 
London) and by King et al (1989, Molecular and Cell Biology of Yeasts, E F Walton and 

35 G T Yarronton, eds, pp 107-133, Blackie, Glasgow). 
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For several reasons Saccharomyces cerevisiae is well suited for heterologous gene 
expression. First, it is non-pathogenic to humans and it is incapable of producing 
certain endotoxins. Second, it has a long history of safe use following centuries of 
commercial exploitation for various purposes. This has led to wide public acceptability. 
5 Third, the extensive commercial use and research devoted to the organism has 
resulted in a wealth of knowledge about the genetics and physiology as well as large- 
scale fermentation characteristics of Saccharomyces cerevisiae. 

A review of the principles of heterologous gene expression in Saccharomyces 
10 cerevisiae and secretion of gene products is given by E Hinchcliffe E Kenny (1993, 
"Yeast as a vehicle for the expression of heterologous genes", Yeasts, Vol 5, Anthony 
H Rose and J Stuart Harrison, eds, 2nd edition, Academic Press Ltd.). 

Several types of yeast vectors are available, including integrative vectors, which require 
15 recombination with the host genome for their maintenance, and autonomously 
replicating plasmid vectors. 

In order to prepare the transgenic Saccharomyces, expression constructs are prepared 
by inserting the nucleotide sequence of the present invention into a construct designed 
20 for expression in yeast. Several types of constructs used for heterologous expression 
have been developed. The constructs may contain a promoter active in yeast, such as 
a promoter of yeast origin, such as the GAL1 promoter, is used. Usually a signal 
sequence of yeast origin, such as the sequence encoding the SUC2 signal peptide, is 
used. A terminator active in yeast ends the expression system. 

25 

For the transformation of yeast several transformation protocols have been developed. 
For example, a transgenic Saccharomyces according to the present invention can be 
prepared by following the teachings of Hinnen et al (1978, Proceedings of the National 
Academy of Sciences of the USA 75, 1929); Beggs, J D (1978, Nature, London, 275, 
30 104); and Ito, H et al (1983, J Bacteriology 153, 163-168). 

The transformed yeast cells may be selected using various selective markers. Among 
the markers used for transformation are a number of auxotrophic markers such as 
LEU2, HIS4 and TRP1, and dominant antibiotic resistance markers such as 
35 aminoglycoside antibiotic markers, eg G418. 
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TRANSFORMED PLANTS/PLANT CELLS 

A preferred host organism suitable for the present invention is a plant. 

5 In this respect, the basic principle in the construction of genetically modified plants is to 
insert genetic information in the plant genome so as to obtain a stable maintenance of 
the inserted genetic material. 

Several techniques exist for inserting the genetic information, the two main principles 
10 being direct introduction of the genetic information and introduction of the genetic 
information by use of a vector system. A review of the general techniques may be 
found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205-225) 
and Christou (Agro-Food-Industry Hi-Tech March/April 1994 17-27). 

1 5 Even though the promoter of the present invention is not disclosed in EP-B-0470145 
and CA-A-2006454, those two documents do provide some useful background 
commentary on the types of techniques that may be employed to prepare transgenic 
plants according to the present invention. Some of these background teachings are 
now included in the following commentary. 

20 

The basic principle in the construction of genetically modified plants is to insert genetic 
information in the plant genome so as to obtain a stable maintenance of the inserted 
genetic material. 

25 Thus, in one aspect, the present invention relates to a vector system which carries a 
nucleotide sequence or construct according to the present invention and which is 
capable of introducing the nucleotide sequence or construct into the genome of an 
organism, such as a plant. 

30 The vector system may comprise one vector, but it can comprise two vectors. In the 
case of two vectors, the vector system is normally referred to as a binary vector 
system. Binary vector systems are described in further detail in Gynheung An et a/. 
(1980), Binary Vectors, Plant Molecular Biology Manual A3. 1-19. 

35 One extensively employed system for transformation of plant cells with a given 
promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from 
Agrobacterium tumefaclens or a Ri plasmid from Agrobacterium rhizogenes An et al 
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(1986), Plant Physiol. 81, 301-305 and Butcher D.N. et at. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Helgeson, 203-208. 

Several different Ti and Ri plasmids have been constructed which are suitable for the 
5 construction of the plant or plant cell constructs described above. A non-limiting 
example of such a Ti plasmid is pGV3850. 

The nucleotide sequence or construct of the present invention should preferably be 
inserted into the Ti-plasmid between the terminal sequences of the T-DNA or adjacent 
1 0 a T-DNA sequence so as to avoid disruption of the sequences immediately surrounding 
the T-DNA borders, as at least one of these regions appear to be essential for insertion 
of modified T-DNA into the plant genome. 

As will be understood from the above explanation, if the organism is a plant, then the 
1 5 vector system of the present invention is preferably one which contains the sequences 
necessary to infect the plant (e.g. the vir region) and at least one border part of a T- 
DNA sequence, the border part being located on the same vector as the genetic 
construct. Preferably, the vector system is an Agrobacterium tumefaciens Ti-plasmid 
or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof, as these plasmids 
20 are well-known and widely employed in theconstruction of transgenic plants, many 
vector systems exist which are based on these plasmids or derivatives thereof. 

In the construction of a transgenic plant the nucleotide sequence or construct of the 
present invention may be first constructed in a micro-organism in which the vector can 

25 replicat e and whic h is easy to manipulate before insertion into the plant. An example of 
a useful micro-organism is E. colL, but other micro-organisms having the above 
properties may be used. When a vector of a vector system as defined above has been 
constructed in E. coli it is transferred, if necessary, into a suitable Agrobacterium 
strain, e.g. Agrobacterium tumefaciens. The Ti-plasmid harbouring the nucleotide 

30 sequence or construct of the invention is thus preferably transferred into a suitable 
Agrobacterium strain, e.g. A. tumefaciens, so as to obtain an Agrobacterium cell 
harbouring the nucleotide sequence or construct of the invention, which DNA is 
subsequently transferred into the plant cell to be modified. 

35 As reported in CA-A-2006454, a large amount of cloning vectors are available which 
contain a replication system in E. coli and a marker which allows a selection of the 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCJD: <WO 0078975A2_I_> 



Best Available Copy 



WO 00/78975 PCT/GB00/02641 

46 

transformed cells. The vectors contain for example pBR 322, the pUC series, the M13 
mp series, pACYC 184 etc. 



In this way, the nucleotide or construct of the present invention can be introduced into a 
5 suitable restriction position in the vector. The contained plasmid is used for the 
transformation in E.cofi. The E.coli cells are cultivated in a suitable nutrient medium 
and then harvested and lysed. The plasmid is then recovered. As a method of 
analysis there is generally used sequence analysis, restriction analysis, electrophoresis 
and further biochemical-molecular biological methods. After each manipulation, the 
10 used DNA sequence can be restricted and connected with the next DNA sequence. 
Each sequence can be cloned in the same or different plasmid. 

After each introduction method of the desired promoter or construct or nucleotide 
sequence according to the present invention in the plants the presence and/or insertion 

15 of further DNA sequences may be necessary. If, for example, for the transformation 
the Ti- or Ri-plasmid of the plant cells is used, at least the right boundary and often 
however the right and the left boundary of the Ti- and Ri-plasmid T-DNA, as flanking 
areas of the introduced genes, can be connected. The use of T-DNA for the 
transformation of plant cells has been intensively studied and is described in EP-A- 

20 120516; Hoekema, in: The Binary Plant Vector System Offset-drukkerij Kanters B.B., 
Alblasserdam, 1985, Chapter V; Fraley, et a/., Crit. Rev. Plant Sci., 4:1-46; and An et 
aL, EMBO J. (1985) 4:277-284. 

Direct infection of plant tissues by Agrobacterium is a simple technique which has been 
25 widely employed and which is described in Butcher D.N. et a/. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson, 203-208. For 
further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant Mol Biol 
[1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/April 1994 17- 
27). With this technique, infection of a plant may be done on a certain part or tissue of 
30 the plant, i.e. on a part of a leaf, a root, a stem or another part of the plant. 

Typically, with direct infection of plant tissues by Agrobacterium carrying the promoter 
and/or the GOI, a plant to be infected is wounded, e.g. by cutting the plant with a razor 
or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound 
35 is then inoculated with the Agrobacterium. The inoculated plant or plant part is then 
grown on a suitable culture medium and allowed to develop into mature plants. 
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When plant cells are constructed, these cells may be grown and maintained in 
accordance with well-known tissue culturing methods such as by culturing the cells in a 
suitable culture medium supplied with the necessary growth factors such as amino 
acids, plant hormones, vitamins, etc. Regeneration of the transformed cells into 
5 genetically modified plants may be accomplished using known methods for the 
regeneration of plants from cell or tissue cultures, for example by selecting transformed 
shoots using an antibiotic and by subculturing the shoots on a medium containing the 
appropriate nutrients, plant hormones, etc. 

10 Other techniques for transforming plants include ballistic transformation, the silicon 
whisker carbide technique (see Frame BR t Drayton PR, Bagnaall SV, Lewnau CJ, 
Bullock WP, Wilson HM, Dunwell JM, Thompson JA & Wang K (1994) Production of 
fertile transgenic maize plants by silicon carbide whisker-mediated transformation, 
The Plant Journal 6: 941-948) and viral transformation techniques (e.g. see Meyer P, 

15 Heidmann I & Niedenhof I (1992) The use of cassava mosaic virus as a vector 
system for plants, Gene 110: 213-217). 

Further teachings on plant transformation may be found in EP-A-0449375. 

20 BALLISTIC TRANSFORMATION OF PLANTS AND PLANT TISSUE 

Originally developed to produce stable transformants of plant species which were 
recalcitrant to transformation by Agrobacterium tumefaciens, ballistic transformation of 
plant tissue, which introduces DNA into cells on the surface of metal particles, has 
25 found utility in testing the performance of genetic constructs during transient 
expression. In this way, gene expression can be studied in transiently transformed 
cells, without stable integration of the gene in interest, and thereby without time- 
consuming generation of stable transformants. 

30 In more detail, the ballistic transformation technique (otherwise known as the particle 
bombardment technique) was first described by Klein et al. [1987], Sanford et ai 
[1987] and Klein et a/. [1988] and has become widespread due to easy handling and 
the lack of pre-treatment of the cells or tissue in interest. 

35 The principle of the particle bombardment technique is direct delivery of DNA-coated 
micro-projectiles into intact plant cells by a driving force (e.g. electrical discharge or 
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compressed air). The micro-projectiles penetrate the cell wall and membrane, with 
only minor damage, and the transformed cells then express the promoter constructs. 

One particle bombardment technique that can be performed uses the Particle Inflow 
5 Gun (PIG), which was developed and described by Finer et a/. [1992] and Vain et a/. 
[1993]. The PIG accelerates the micro-projectiles in a stream of flowing helium, 
through a partial vacuum, into the plant cells. 

One of advantages of the PIG is that the acceleration of the micro-projectiles can be 
10 controlled by a timer-relay solenoid and by regulation the provided helium pressure. 
The use of pressurised helium as a driving force has the advantage of being inert, 
leaves no residues and gives reproducible acceleration. The vacuum reduces the 
drag on the particles and lessens tissue damage by dispersion of the helium gas 
prior to impact [Finer et a/. 1992]. 

15 

In some cases, the effectiveness and ease of the PIG system makes it a good choice 
for the generation of transient transformed guar tissue, which were tested for 
transient expression of promoter/reporter gene fusions. 



20 GUAR 

As indicated above, in one aspect, a preferred transformed organism is transformed 
guar. 

25 Guar (Cyamopsis tetragonoloba) is a drought-tolerant species, which originated in 
India and Pakistan, but is cultivated for industrial use in a number of other countries 
e.g. USA. Guar is taxonomicalty arranged with the peas and beans in the family of 
grain legumes. Legumes are dicotyledonous plants (dicots), which are characterised 
by having broad leaves and two cotyledons, in contrast to monocotyledonous plants 

30 (monocots) which display 'grass like 1 morphology and only have one. Examples of 
other dicots are tobacco and potato, and examples of monocots are cereals such as 
wheat, rice, maize and barley. Guar differs from most crop legumes such as peas 
and beans, however, in that its seed carbohydrate reserve is not starch in the 
cotyledons, but is galactomannan accumulated in the endosperm. The endosperm is 

35 a non-photosynthetic tissue, which envelopes the cotyledons in the seed immediately 
beneath the seed coat, and can be dissected away from the other seed tissues. 
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GUAR GUM 



Guar is grown as crop for animal feed and human consumption. However, an 
important use of guar is the extraction of guar gum or 'guaran', which is used as a 
5 functional ingredient in foods e.g. as thickener in ice cream. Guar gum consists of 
galactomannan - which is an extracellular cell-wall polysaccharide located in the 
endosperm. 

In a preferred aspect, the present invention relates to the use of the promoter of the 
1 0 present invention to cause expression of an NOI that can affect guar gum synthesis. 

GALACTOMANNAN 



15 



20 



Galactomannan consists of a mannan backbone with substituted galactosyl-groups, 
the bonding of which is (1^6)-a-D : galacto-(1^4)-p.p-mannan, and the polymer has 
an molecular weight of 220,000 daltons (Whistler & Hymowitz 1979). 

The degree of substitution, and thereby the ratio between galactose and mannose 
residues in galactomannan. varies between leguminous species. The ratio is approx. 
0.56 in guar, and mature seeds contains approx. 35-42 % galactomannan (Whistler 
& Hymowitz 1979). 



In a preferred aspect, the present invention relates to the use of the promoter of the 
present invention to cause expression of an NOI that can affect galactomannan 
25 synthesis. 

ENDOSPERM EXPRESSION 

In a preferred aspect, the promoter of the present invention is active in the 
30 endosperm of plant tissue. 

In this respect, endosperm - as well as the embryo - are formed early in seed 
development of flowering plants. In particular, the endosperm is formed during 
double fertilisation in which, one sperm nucleus fuses with an egg to produce the 
35 embryo, and a second sperm nucleus fuses with two polar nuclei to form the triploid 
endosperm [Lopes & Larkin 1993]. In some, but by no means all. plants this tissue 
serves as a storage reservoir for the seeds, and guar and the cereals are among 



iNSDOCID: <WO 0078975A2J_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/GBOO/02641 

50 

these. In other species, however, the cotyledons have assumed the status of the 
principal storage tissue of the seed, and the endosperm has become vestigial. 



In cereals such as rice, wheat, barley, and maize the chief carbohydrate reserve in 
5 the endosperm is starch which accumulates intracellular^ in plastids known as 
amyloplasts. In guar endosperm, however, the principal storage carbohydrate is 
galactomannan which accumulates extracellularly in the intercellular spaces, and is 
therefore secreted out of the cell. 

10 During maturation, the developing seed increases in volume and mass due to 
significant cell expansion and accumulation of carbohydrates, protein and lipids to be 
used as C and N sources during germination. After maturation, the seed enters 
dormancy [West & Harada 1993], imposed by plant hormones, such as abscisic acid 
(ABA), which prevent precocious germination [Thomas 1993]. On germination of 

1 5 guar seeds, mannanase, and galactosidase activities break down the galactomannan 
of the endosperm, and the mannose and galactose released support the 
development of the seedling. 



CARBOHYDRATE METABOLISM 

20 

In a preferred aspect, the present invention relates to the use of the promoter of the 
present invention to cause expression of an NOI that can affect carbohydrate 
metabolism, such as sucrose metabolism, particularly in a plant tissue. 

25 In this respect, sucrose is the major photoassimilate which is transported long 
distances in plants from the photosynthetic 'source' tissue to the heterotrophic 
energy-consuming 'sink' tissues, in which it is the mainstay of carbohydrate nutrition. 

The initial precursor for the synthesis of sucrose is triose-phosphate which is 
30 synthesised in the carbon-fixation (Calvin) cycle in chloroplast in photosynthetic cells. 
This triose phosphate (glyceraldehyde 3-phosphate) is transported to the cytoplasm 
and converted to fructose 6-phosphate (Fructose-6-P) and glucose 1 -phosphate 
(Glucose-1-P). The Glucose-1-P is converted to UDP-glucose which together with 
Fructose-6-P are converted to sucrose 6-phosphate (Sucrose-6-P) by sucrose 
35 phosphate synthase. S-6-P is the immediate precursor for sucrose and the 
conversion is catalysed by sucrose phosphatase. 
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After its synthesis in the cytoplasm of mesophyll cells, sucrose is loaded by the 
companion cells into the phloem and distributed to sink tissues. The transport 
process is a "mass flow' facilitated by osmotic pressure, due to a concentration 
gradient of sucrose between source and sink. Photosynthesis is slowed by build up 
5 of sucrose in green tissues, and so it is seen by many as a demand-led process, with 
the vascular tissue communicating requirement for carbohydrate by unloading of 
sucrose into developing sink tissues. 



10 



25 



30 



The developing seed is a storage-active sink organ, and is therefore a large user of 
sucrose. Developing seeds utilise sucrose for several purposes, one example of 
which is synthesis of storage polysaccharides as galactomannan in guar endosperm. 



Sucrose is degraded by sucrose synthase to fructose (F) and UDP-galactose 
(UDPG) and through a series of conversions GDP-mannose (GDPM) and UDPG are 
15 formed. GDPM and UDPG are the immediate precursors for galactomannan and the 
syndesis is catalysed By mannan synthase and galactosyl transferase. 
Galactomannan is synthesized and transported from the golgi apparatus to the 
extracellular space. Another key enzyme in the synthesis of galactomannan is UDP 
galactose 4-epimerase. 

20 

Breakdown of sucrose in plants can be achieved through the activity of two enzymes: 
- Invertase (E.C. 3.2. 1 .26) and sucrose synthase (E.C. 2.4. 1 . 1 3). The name 'sucrose 
synthase' implies that the enzyme catalyses the synthesis of sucrose, but the 
metabolic role of sucrose synthase is catabolic rather than anabolic, and sucrose 
synthase preferentially cleaves sucrose in vivo. Consistent with this role, sucrose 
synthase is abundant in sink tissues such as developing seeds, but not in fully 
competent photosynthetic tissues. Sucrose synthase catalyses the cleavage of 
sucrose in the presence of UDP into UDP-glucose and fructose. By cleavage of 
sucrose in the presence of UDP the high energy of the sucrose glycosidic link is 
conserved in UDP-glucose. which can then serve as a glycosidic donor for further 
reactions. Invertase. on the other hand, catalyses the hydrolysis of sucrose into 
glucose and fructose, and probably serves less to provide carbohydrate for storage, 
but may furnish the immediate energy requirements of development. 

35 In a preferred aspect, the present invention relates to the use of the promoter of the 
present invention to cause expression of an NOI that can affect the in situ activity of 
UDP galactose 4-epimerase. 
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GUS REPORTER GENE 

In some of the studies of the present invention, the E.coli uidA gene (otherwise 
5 known as the GUS gene) is used as a reporter gene. Here, the E.coli uidA gene is 
used as proof that the promoter of the present invention can cause expression of an 
NOI. However, it is to be noted that the present invention is not limited to the NOI 
just being the E.coli uidA gene. 

10 In more detail, the uidA gene codes for the p-glucuronidase enzyme (GUS) which is 
a hydrolase that catalyses the cleavage of p-glucuronides, e.g. X-gluc and MUG. 
The GUS reporter gene system for higher plants was developed by Jefferson 1987. 

The system utilises that the uidA gene can be fused to a promoter and introduced in 
15 to plant cells for expression. By assaying the activity of the GUS protein in 
transgenic tissue, the activity of a given promoter can be monitored. In some of our 
studies, the E.coli uidA gene has been modified to prevent glycosylation in plants, 
which enables GUS to retain activity when targeted to the endoplasmatic reticutum 
(ER). 

20 

For transient expression studies, X-GIuc (5-bromo-4-chloro-3-indolyl b-D- 
glucuronide) is used as a substrate in histochemical detection of GUS (qualitative), 
and MUG (4-methyl umbelliferyl glucuronide) as a substrate for fluorometrical 
detection (quantitative). The enzymatic cleavage of the colourless X-GIuc substrate 
25 produces a blue indigo precipitate at the site of cleavage. The precipitation is caused 
by oxidative dimerisation of the indoxyl derivative produced by the cleavage of X- 
Gluc. The activity of GUS can therefore be detected and analysed in transgenic 
tissue without extracting the enzyme before running the assay. 

30 PRODUCTION OF POI 

Host organisms comprising the vectors of the present invention may be used to 
express the POI by use of the promoter of the present invention. In this respect, host 
organisms may be cultured under suitable conditions which allow expression of the 
35 POI. In some instances, expression of the POI may be constitutive such that they 
are continually produced, or inducible, requiring an inducer to initiate expression. In 
the case of inducible expression, protein production can be initiated when required 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 

PCT/GBOO/02641 

53 

by, for example, addition of an inducer substance to the culture medium. 

Once the vector has been transformed or transfected into a suitable host organism 
then the host organism can be cultivated. Here reference can be made briefly to US- 
5 A-5543322 which says that for cultivation of a transformant. a culture medium 
containing carbon and nitrogen sources assimilable by the transformant and the like 
can be used. Any carbon source assimilable by the transformant can be used. 
Examples thereof include glucose, sucrose, starch, soluble starch, dextrin, glycerin, 
n-paraffin and the like as well as organic acids (e.g.. acetic acid, fumaric acid.' 
benzoic acid, etc.), alcohols (e.g.. methanol, ethanol. butanol. etc.), fats and oils 
(soybean oil. lard, etc.) and the like. They can be used alone or in combination 
thereof. As the nitrogen sources, there are. for example, peptone, soybean flour, 
cotton seed flour, meat extract, yeast extract, dried yeast, corn steep liquor, com 
gluten meal, urea, ammonium salts (e.g.. ammonium chloride, ammonium sulfate. 
eta)l _ nitrates J®?;- P° ta ssium nitrate, ammonium nitrate, etc.), other organic or 
inorganic nitrogen-containing materials and the like. They can be used alone or in 
combination thereof. In addition, inorganic salts (e.g., phosphates, etc.). trace metal 
salts (e.g.. magnesium salt, calcium salt, manganese salt, etc.) can be appropriately 



10 



15 



20 



35 



added. 



If desired, the POI can be extracted from host organisms by a variety of techniques 
known in the art. including enzymatic, chemical and/or osmotic lysis and physical 
disruption. For some applications, a preferred extraction/purification protocol may 
involve a centriguation step followed by. if necessary using, column chromatography 
25 such as ion-exchange or affinity chromatography. 

Thus, after the desired product (e.g. the POI) has accumulated in a culture medium, 
a supernatant fluid containing the POI can be obtained by centrifugation or filtration. 
On the other hand, when the POI has accumulated in the organisms, after the 
30 cultivation, the organisms can be collected by a known method and the desired 
product is recovered by an appropriate method. For example, the organisms can be 
suspended in a buffer containing a protein denaturant such as guanidine 
hydrochloride, the suspension is stirred in a cold place, and then the supernatant 
fluid containing the desired product is obtained by centrifugation or the like. 



Alternatively, after the organisms have been suspended in a buffer, the organisms 
can be ground by glass beads, or broken by French press, sonication, enzymatic 
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treatment or the like, and then the supernatant fluid is obtained by centrifugation or 
the like. 

For separation and purification of the POI from the above supernatant fluid reference 
5 can be made to US-A-554332 where it is stated that per se known separation and 
purification methods can be appropriately combined. As the known separation and 
purification methods, there are, for example, a method utilizing a difference in 
solubilities (e.g., salting out, precipitation with a solvent, etc.), a method mainly 
utilizing a difference in molecular weights (e.g., dialysis, ultrafiltration, gel filtration, 
10 etc.). a method utilizing a difference in charges (e.g., ion exchange chromatography, 
etc.), a method utilizing specific affinity (e.g., affinity chromatography, etc.), a method 
utilizing a difference in hydrophobicities (e.g., reverse phase high performance liquid 
chromatography, etc.), a method utilizing a difference in isoelectric points (e.g., 
isoelectric focusing, etc.) and the like. 



15 



SECRETION 



In some cases, it is desirable for the POI to be secreted from the expression host 
into the culture medium from where the POI may be more easily recovered. 
20 According to the present invention, the secretion leader sequence may be selected 
on the basis of the desired expression host. Hybrid signal sequences may also be 
used with the context of the present invention. 

Typical examples of heterologous secretion leader sequences are those originating 
25 from the fungal amyloglucosidase (AG) gene (g/aA - both 18 and 24 amino acid 
versions e.g. from Aspergillus), the a-factor gene (yeasts e.g. Saccharomyces and 
Kluyveromyces) or the a-amylase gene {Bacillus). 

DETECTION 

30 

A variety of protocols for detecting and measuring the expression of the POI are 
known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non- 
35 interfering epitopes on the POI may be used or a competitive binding assay may be 
employed. These and other assays are described, among other places, in Hampton 
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R et al (1990, Serological Methods, A Laboratory Manual, APS Press. St Paul MN) 
and Maddox DE et al (1983, J Exp Med 15 8:121 1). 

A wide variety of labels and conjugation techniques are known by those skilled in the 
5 art and can be used in various nucleic and amino acid assays. Means for producing 
labelled hybridization or PCR probes for detecting POI include oligolabelling, nick 
translation, end-labelling or PCR amplification using a labelled nucleotide. 
Alternatively, the NOI. or any portion of it. may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. 



10 



A number of companies such as Pharmacia Biotech (Piscataway, NJ). Promega 

(Madison, Wl), and US Biochemical Corp (Cleveland. OH) supply commercial kits 
15 and protocols for these procedures. Suitable reporter molecules or labels include 

those radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 

agents as well as substrates, cofactors, inhibitors, magnetic particles and the like. 

Patents teaching the use of such labels include US-A-3,817,837; US-A-3,850,752; 

US-A-3.939,350; US-A-3,996.345; US-A-4,277,437; US-A-4,275.149 and US-A- 
20 4,366,241. Also, recombinant immunoglobulins may be produced as shown in US-A- 

4,816,567. 

Additional methods to quantitate the expression of a POI include radiolabeling (Melby 
PC et al 1993 J Immunol Methods 159:235-44) or biotinylating (Duplaa C et al 1993 
25 Anal Biochem 229-36) nucleotides, coamplification of a control nucleic acid, and 
standard curves onto which the experimental results are interpolated. Quantitation of 
multiple samples may be speeded up by running the assay in an ELISA format where 
the oligomer of interest is presented in various dilutions and a spectrophotometric or 
calorimetric response gives rapid quantitation. 

30 

Although the presence/absence of marker gene expression suggests that the gene 
of interest is also present, its presence and expression should be confirmed. For 
example, if the NOI is inserted within a marker gene sequence, recombinant cells 
containing NOIs can be identified by the absence of marker gene function. 
35 Alternatively, a marker gene can be placed in tandem with a NOI under the control of 
the promoter of the present invention or an alternative promoter (preferably the same 
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promoter of the present invention). Expression of the marker gene in response to 
induction or selection usually indicates expression of the POI as well. 



Alternatively, host cells which contain the NOI may be identified by a variety of 
5 procedures known to those of skill in the art. These procedures include, but are not 
limited to, DNA-DNA or DNA-RNA hybridization and protein bioassay or 
immunoassay techniques which include membrane-based, solution-based, or chip- 
based technologies for the detection and/or quantification of the nucleic acid or 
protein. 

10 

SUMMARY 



In summation, the present invention relates to a promoter and, also to a construct 
comprising the same. In particular the present invention relates to the use of a 
15 promoter for the expression of a NOI in an organism. 

In a preferred aspect, the present invention relates to the modification of carbohydrate 
metabolism by the transformation (genetic manipulation) of a plant by use of the novel 
promoter of the present invention to cause expression of an NOI that affects 
20 carbohydrate metabolism. . — . 

In one preferred aspect, the present invention relates to the modification of 
galactomannan synthesis by the transformation (genetic manipulation) of the 
leguminous plant Cyamopsis tetragonoloba (guar) by use of the novel promoter of the 
25 present invention to cause expression of an NOI that affects the galactomannan 
synthesis. 
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DEPOSITS 

The following sample was deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria 
5 Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom. AB2 
1RY on 15 March 1999: 



Microorganism strain Number 

E - co,i TOP 10 (Invitrogen) 

+ pTBR-ScaK3 



NCIMB Number 

NCIMB 41011 



NCIMB 41011 comprises the novel nucleotide sequence of the present 



invention. 



10 



15 



20 



._. A re ^^ n map ^ ^ e nmelnuaeoti<ie sec ' uence ° f the present invention is shown in 
Figure 3. 

The present invention also encompasses sequences derivable from those deposits and 
embodiments comprising the same. The present invention also encompasses partial 
sequences derivable from those deposits and embodiments comprising the same, 
wherein those partial sequences code for regulatory elements. 

The promoter of the present invention may be isolated from the deposit by use of any 
of the suitable restriction enzymes indicated in Figure 2. Alternatively, one could use 
PCR techniques to "PCR out" the promoter sequence. Here, suitable primers would be 
based on the sequence presented as SEQ ID No. 1. 

By way of example, for amplification of the full-length RSus3 promoter (2700 bp) from 
pTBR-ScaK3. the primer-pair "M13 forward" and "RSusNcol" can be used. The upper 
M13 forward primer anneals to the pCR2.1-TOPO part of the clone some 110 bp 
from the cloning site, and the lower RSusNcol primer anneals to the translation start 
codon of RSus3. Alternatively the primer-pair "AP2" and "RSusNcol" can be used to 
isolate a similar fragment (AP2 anneals to the adaptor part of the cloned fragment). 

For amplification of the RSus3 promoter without intron 1 from pTBR-ScaK3, the 
primer-pair "AP2" and "LowexonVS" can be used. The lower LowexonY, primer is 
designed as follows: The 3' end of the primer corresponds to a 24 bp sequence just 
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upstream of the 5' splicing site in intron 1. The 5' end of the primer corresponds to 
the 27 bp sequence between the 3* splicing site and the ATG-codon, and 
incorporates of a Ncol site. 

5 Hence, in summation, the present invention relates to a promoter that is useful for 
causing selective expression of an NOI in the endosperm tissue/cells of a transformed 
plant. The NOI and/or the promoter may be heterologous to the transformed plant - in 
the sense that the NOI and/or the promoter may not naturally occur in the non- 
transformed plant. The promoter of the present invention is defined by having a 

10 nucleotide sequence corresponding to that shown as SEQ ID No. 1 or a variant, 
homologue, fragment or derivative thereof. 



INTRODUCTION TO THE EXAMPLES SECTION AND THE FIGURES 

15 The present invention will now be described, by way of example only, with reference 
to the accompanying drawings in which:- 



Figure 1 is a schematic diagram 
Figure 2 is a schematic diagram 

20 Figure 3 is a schematic diagram 
Figure 4 is a photograph 
Figure 5 is a schematic diagram 
Figure 6 is a schematic diagram 
Figure 7 is a schematic diagram 

25 Figure 8 is a schematic diagram 
Figure 9 is a schematic diagram 
Figure 10 is a schematic diagram 
Figure 11 is a graph 
Figure 12 is a schematic diagram 

30 Figure 13 is a schematic diagram 
Figure 14 is a schematic diagram 
Figure 15 is a photograph 
Figure 16 is a series of photographs 
Figure 17 is a graph 

35 Figure 18 is a graph 

Figure 19 is a schematic diagram 
Figure 20 is a schematic diagram 
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Figure 21 is a schematic diagram 
Figure 22 is a schematic diagram 
Figure 23 is a schematic diagram 
Figure 24 is a schematic diagram 
5 Figure 25 is a schematic diagram 
Figure 26 is a graph. 

Part I - Figures In more detail 

10 Figure 1 : A schematic representation (not to scale) of the promoter of the present 
invention and some of its associated sequences. 

Figure 2: Map of the RSus3 sequence in P ScaK3 ( P TBR-ScaK3) with the surrounding 
restriction sites. The other clones are analogues to this, but some have the opposite 
1 5 orientation. 

Figure 3 is a restriction map of the promoter of the present invention. 

Figure 4: Agarose gel showing the result of the semi-nested PCR. Lane 1 ana :2 
20 marker II; lane 2-6 EcoRV. Dral, Pvull, Seal and Sspl with buffer F: lane 7-11 same 
with buffer H. 

Figure 5. The promoter region from P ScaK3 and P SspK3 were amplified from the 
primers M13 forward and RSusNco. The amplified products contain Xhol and Ncol 
25 sites for directional cloning into the unique Sail and Ncol in pGUSNOSt as Xhol/Ncol 
fragments. 

Figure 6 is a schematic diagram of a construct. 
30 Figure 7 is a schematic diagram of a construct. 
Figure 8 is a schematic diagram of a construct. 
Figure 9 is a schematic diagram of a construct, 

35 

Figure 10 is a schematic diagram of a construct. 
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Figure 1 1 shows a time course of sucrose synthase activity during development of 
guar endosperm. The specific activity of sucrose synthase were measured in 
extracts from guar endosperm at various developmental stages from 7-41 days after 
5 flowering. The value of each point is the mean sucrose synthase activities obtained 
from 3 to 4 independent measurements in which at least 5 endosperms from each 
pod were pooled. 

Figure 12: Presentation of sequencing plan for pTBR-ScaK3. The arrows represent 
10 sequencing primers, and the numbers indicates approximate position of 5*end of the 
sequencing primers relative to the RSus3 ATG codon. 

Figure 13: Schematic presentation of various RSus3/GUS/NOSt constructs. The 
numbers indicates the base pairs relative to the translational start codon. 

15 

Figure 14: Schematic presishtatioh of various tandem Rsus3 constructs. The 
numbers indicates the base pairs relative to the translational start codon. 

Figure 15: Agarose gel showing the result of control digest of tandem repeat clones. 
20 Lane 1: Hindlll digest of pTandem leg/pro, which gives a specific fragment of 740 bp. 
Lane 2: Nhel digest of pTandem legumin, which gives a specific fragment of 570 bp. 
Lane 3: DNA marker VI. Lane 4: Hindlll digest of pTandem prote.min, which gis a 
specific fragment of 160 bp. 

25 Figure 16a is a photograph of cotyledon tissue of guar that has been ballistically 
transformed with the ENOS control plasmid that contains the GUS expression 
nucleotide sequence. This is a control study. As can be seen, high levels of 
transient expression of GUS are observed. 

30 Figure 16b is a photograph of endosperm tissue of guar that has been ballistically 
transformed with the ENOS control plasmid that contains the GUS expression 
nucleotide sequence. This is a control study. As can be seen, transient expression 
of GUS is observed. . 

35 Figure 16c is a photograph of cotyledon tissue of guar that has been ballistically 
transformed with the RSus3 construct plasmid p1730 (which contains a single copy 
of the nucleotide sequence of the present invention) that contains the GUS 
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expression nucleotide sequence. As can be seen, very low levels of transient 
expression of GUS are observed. 

Figure 16d is a photograph of endosperm tissue of guar that has been ballistically 
5 transformed with the RSus3 construct plasmid p1730 (which contains a single copy 
of the nucleotide sequence of the present invention) that contains the GUS 
expression nucleotide sequence. As can be seen, good levels of transient 
expression of GUS are observed. 



10 



15 



25 



35 



Figure 16e is a photograph of cotyledon tissue of guar that has been ballistically 
transformed with the RSus3 construct plasmid pTandem leg/pro (which contains a 
tandem copy of the nucleotide sequence of the present invention) that contains the 
GUS expression nucleotide sequence. As can be seen, very low levels of transient 
expression of GUS are observed. 



Figure 16f is a photograph of endosperm tissue of guar that has been ballistically 
transformed with the RSus3 construct plasmid pTandem leg/pro (which contains a 
tandem copy of the nucleotide sequence of the present invention) that contains the 
GUS expression nucleotide sequence. As can be seen, high levels of transient 
20 expression of GUS are observed. 

Figure 17. Histogram showing the transient GUS expression in guar endosperms 
after bombardment with the RSus3 constructs. The data are presented as mean 
number of blue spots per endosperm together with the calculated standard error. 



Figure 18. Histogram showing the transient GUS expression in guar tissue after 
correction for tissue size. The data are presented as mean number of blue spots per 
cotyledon together with the calculated standard error. 



30 MATERIALS AND METHODS 
Isolation of DNA 



The plasmids used in this work were isolated using Qiaprep Spin Miniprep Kit and 
Plasmid Maxi Kit from Qiagen (Hilden. Germany). These kits give very clean plasmid 
preparations which are ready for use in cloning, sequencing, and transient 
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expression assays. Further purification are therefore not necessary. 10 mM Tris-CI, 
pH 8.5 was used as elution buffer and storage buffer. All DNA was stored at -20°C. 



The concentration of plasmid preparations was determined, by measuring the 
5 absorbance at 260 nm in a Powerwave™ 200 spectrophotometer. An OD 26 o of 1 
corresponds to about SO^g/mL DNA. The amounts of DNA in isolated fragments 
was estimated by comparative fluorescence of Ethidium-bromide (Etbr) stained DNA 
in agarose gels. 



10 Restriction enzymes 



Enzyme digestion was performed using restriction enzymes from NEB and 
Boehringer M. The buffer provided by the supplier was used in the reactions. When 
double digestions were performed, a buffer compatible with both enzymes was 
1 5 chosen_(e.g. ... d igestiqns with both Nhel and Ncol was in NE Buffer 2 with 

BSA). 



Electrophoresis 

20 Electrophoresis was used to separate and identify DNA. The DNA was stained with 
the fluorescent dye Ethidium bromide (EtBr) incorporated in the gel in a 
concentration of 0.6 jag/mL Appropriate amount of DNA was mixed with loading 
buffer (0.1 % bromophenol blue, 16% Ficoll 400) loaded on to the either Nusieve or 
Seakem agarose gels (FMC Byproducts). TBE Buffer (0.1 M Tris, 0.09 M Boric acid, 

25 0.001 M EDTA) was used as elecrophoresis buffer. DNA were separated by 
electrophoresis in a 0.5 - 2 % (w/v) agarose gel, dependent upon the sizes of the 
bands of interest. For bands <1000 bp 2 % Nusieve GTG agarose gels, and for 
bands >1000 bp 0.5-1.5 % Seakem LE agarose gels were used. 

30 DNA molecular weight marker I, II. IV, VI and VII (Boehringer M) were used for 
evaluation of electrophoreses DNA. The markers were provided at a concentration of 
0.25 ng/^L and diluted as follows: 2 ^iL DNA marker, 2 *xL loading buffer and 8 ^iL 
H 2 0. Either 4 jaL or 10 jiL of this mixture was loaded on to gels depending upon the 
well size. 

35 
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Selected bands were isolated from the gel. by minimal UV radiation, using Qiaquick 
Gel Extraction (Qiagen). 

' Ligation 

5 

All ligations were performed using T4 DNA ligase in the reaction buffer (66 mM Tris- 
HCI. pH 7.6, 6.6 MgCI 2 , 10 mM dithiothreitol. 66 nM ATP) supplied by the 
manufacturer (Amersham Life Science). Fragment with Cohesive ends were ligated 
using 1 U T4 ligase. whereas blunt-ended fragment were ligated using 5 U of the 
to enzyme. 

To prevent intramolecular religation of open plasmid. it was treated with alkaline 
phosphatase from the psychrophilic shrimp Pandalus borealis (Boehringer M). This 
enzyme catalyses the dephosphorylation of 5' phosphates from the plasmid DNA, but 
is easily prevented from interfering with the subsequent ligation by heating to 60°C 
for 15 minutes. 

Transformation 



15 



20 



25 



35 



All the plasmids were used to transform, and propagated in, supercompetent TOP10 
One Shot™ Cells (Invitrogen) Genotype: F mcrA A(mrr-hsdRMS-mcrBC) 
d)80/acZAM15 A/acX74 cteoR recA1 araD139 A(ara-/eu)7697 ga/U ga/K rpsL endA1 
nupG. 

50 M L of supercompetent cells were carefully mixed with 1-5 nL of ligation mixture, 
and incubated on ice for 30 minutes, after which a 30 second heat shock was 
performed, followed by a 2 minute recovery period on ice. Finally, 250jiL of SOC 
medium were added, and the cells were incubated in an orbital shaker at 37C for 30 
minutes, before spreading on to LB plates containing 100ng/ml ampicillin. 

Polymerase Chain Reaction (PCR) 

Polymerase chain reaction (PCR) is a technique which is based on in vitro 
amplification of a specific DNA sequence using synthetic primers flanking the 
sequence, and a thermostable DNA polymerase (e.g. Taq DNA Polymerase) for 
replication of the DNA. 
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A wide range of PCR techniques were extensively used in this study, e.g. Hot Start, 
Touch Down, Long PCR, Nested PCR and Sequencing PCR. 

5 For optimization of PCR reactions the PCR optimizer™ Kit (Invitrogen) was applied. 
This kit comprises 16 buffers of four different pH values, each supplied with a range 
of four different magnesium ion concentrations, and eases optimization of PCR 
reactions. 

io The following polymerases were used: 

• AmpliTaq Gold™ (Perkin-Eimer) 

• Expand™ High Fidelity PCR system (Boehringer M) 

• Taq DNA polymerase (Pharmacia Biotech) 

15 



Expand™ High Fidelity PCR system is a polymerase mix of the Taq and Pwo 
polymerases, and is designed for amplification of PCR products up to 12 kb with high 
specificity. Taq posses 5'->3' transferase activity and generates ends with single 3* A 
overhang, whereas Pwo posses 3'->5' exonuclease activity and generate products 
20 with blunt ends. 

With the exception of cycle sequencing, all PCR reactions reported here used the 
Hot Start technique, which minimizes binding of primers to poorly matched sites on 
the template at low temperatures, and the subsequent amplification of the spurious 
25 extension products formed in the first cycle of the reaction. This is achieved by 
preventing polymerisation before a substantial period of denaturation of the template 
has elapsed, and this was effected in two different ways depending on the type of 
polymerase. 

30 When the Expand™ or AmpliTaq Gold™ DNA polymerase were used, a wax pellet 
was placed over the reaction mixture containing all components except for 
polymerase. The wax was melted by 95°C for 2 min, followed by cooling to 4°C, 
which results in its hardening to form a thin surface on top of the reaction mixture. On 
to this surface 1jaL of the polymerase was added and the tube was then placed in the 

35 thermocycler, which was preheated to 95°C beforehand, and the PCR programme 
was started immediately. 
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AmpliTaq Gold™ is a thermally activated polymerase mixture which allows a hot 
start, without the inconvenience of overlaying polymerase on to a solid wax surface. 
The enzyme is provided with an inactivating antibody bound to it. The polymerase is 
activated by a preliminary heating step, whereby the antibodies denatures, thus 
rendering the enzyme active. This allows a hot start PCR without the inconvenience 
of overlaying polymerase on to a solid wax surface, and the denaturation time is 
significant longer, and therefore more thorough. In these reactions all the 
components were added together with wax pellet, and the reaction started as 
described above. 



When amplification of long PCR products was desired, a time increment was added 
to the extension step in the PCR program. 

15 The PCR reactions were performed in a Mastercycler (Eppendorf). 
A typical example of such a reaction is summarised in the table below. 



20 



PCR reaction 


PCR program 


Reagent 


nL/reac. 


Step 


Temp (°C) 


Time 


Template (1:100) 


1 


1 


94 


2" 


5*Buffer 


10 


2 


94 


30" 


dNTP mix 


4 


3 


68 


30" 


Upper primer 4 jaM 


5 


4 


go to 2, 29 times 


Lower primer 4 jiM 


5 


5 


68 


10' 


Water 


to 50 


6 


4 


infinite 


Expand 

^. ^ i 1 1 i . 


0.75 









Cloning of PCR products 



25 



Under most circumstances, isolated PCR products were cloned in pCR2.1-TOPO 
(Invitrogen) which was provided as a linearized fragment with single 3'-T overhangs 
and TOPO isomerase. This system exploits the tendency of Taq polymerase to add a 
single overhanging 3 -A in both ends of the amplified product The TOPO isomerase 
has both cleavage and ligation activity, it cleaves the vector at a CCTTT site thereby 
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generating a single 3'-T overhang, which will religate with a PCR product with single 
5'-A overhangs. This system therefore allows cloning of Taq amplified PCR products 
without the use of ligation with T4 ligase. 

5 The cloning site of pCR2.1-TOPO resides in a disruptive locus of the lacZ gene, 
which allows clones with inserts to be selected by blue/white screening in the 
presence of the chromogenic substrate for (J-galactosidase (x-gal). Miniprep DNA 
from white colonies was analysed by restriction digests, PCR and/or sequencing^ 

10 Some PCR-fragments were digested with enzymes and cloned directly into plasmid 
vectors without a sub-cloning step. When cohesive ends could be generated by 
restriction enzymes in this way, and exploited for direct cloning, substantial savings 
in time were achieved. 

15 Primers 

The relevant primers are shown in the Tables presented below. 
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Primers and oligos 

Listed in alphabetic order (Storage name, size) 

AP1: (Walking AP1a, 27-mer) 

5'- GGA TCC TAA TAC GAC TCA CTA TAG GGC 

AP2: (Walking AP2, 17-mer) 

5*- AAT AGG GCT CGA GCG GC 

Dellowl: (Dellow 1, 22-mer) 

5' GCT TTC CAC CAC AAA ATG ACA C 

E356: (E356, 26-mer) 

5'- GGA ATT CTA GTA ACA TAG ATG ACA CC 
E357: (E357. 28-mer) 

5'- GGAATTCCC CGA TCG TTC AAA CAT TTG G 

HindGCN4: (HindGCN4, 24-mer) 

5' GGA AGC TTG CGA AAA TGT GCA GGG 

Lowexon 1 /*: (Low exon 1/2, 56-mer) 



NhedeH: (Nhedel 1, 23-mer) 

5' CC G CTA GCA CAG AGG CTG AGC AG 

Nhedel2: (Nhedel.2, 27-mer) 

5' TGC TAG CTG GTA AAT GAC ATG CTG CTG 

Nhedel3: (Nhedel.3, 22-mer) 

5' C GC TAG C AG AGG CAG CAA GCT C 

NheGCN4: (Nhe GCN4, 22-mer) 

5' GG G CTA GC G AAA ATG TGC AGG G 

Oligo 1: (Walking Adaptor, 44-mer): 

5' CTA ATA CGA CTC ACT ATA GGG CTC GAG CGG CCG CCC GGG CAG GT 

Oligo 2: (Walking AP1b, 8-mer) 
5'phosphate- AC CTG CCC - 3'amine 

pGUS lower: (pGUS lower. 22-mer) 

5' CTG GCG AAA GGG GGA TGT GCT G 

Rsus3: (Lower RSus 3 Spe. 25-mer) 

5" ACG ACG GAA TGG ATA ATA GCA GAT A 




pT ATC TTC TAG TTG GAT CCT CAA GCC TTG CAC TGA AGG GGA 

: (M13 forward. 16-mer, TOPO Cloning Kit (Invitrogen)) 
aCT GCC GGT C 
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RsusNco: (RsusNcol, 21-rner) 

5' GTT TCC C CCATG G CT ATC TTC 

RsusTATA: (RSusTATA, 21-mer) 

5* CCT CCC TG A AGC TT T TCG TGT 

UppCR2.1: (UppCR2.1, 44-mer) 

5* ATT AGG CAC CCC AGG CTT TAC ACT TTA TGC TTC CGG CTC GTA TG 



Lower sequencing primers: 
(Listed in numerical order) 

RSusseq 1 

5' Cy5- GT TTC CCC CAT TGC TAT CTT C 
RSusseq 2 

5* Cy5- AG TGC CAG GTT CAA GGA CA 
RSusseq 700 

5* Cy5- AC CAA TCC CAG AAA CCC AAG C 
RSusseq 1000 

5' Cy5- GT GTC CCC TGC CTC ACT CC 
RSusseq 3 

5' Cy5- CC GGC TAA GTT AAA AAA AAA 
RSusseq 4 

5* Cy5- CT GTG CCG TTG GAA GCG TCA T 
RSusseq 5 

5' Cy5- CG CAG ATG GGT TCA GCC TTC A 
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Upper sequencing primers: 
(Listed in numerical order) 

Scaseq 1 

5' Cy5- GG TCG GCA CAT TGA GAG GTC 
Scaseq 2 

5' Cy5- CA CAC CCA ACG CTC ACC GAT G 
Scaseq 3 

5' Cy5- AG GAC GGT TTT GGT TGG GAT T 
Scaseq 4 

5* Cy5- TC CTC CTC TTC CCC CTT CAG TG 
Scaseq 5 

5' Cy5- AT CTG GCA ACC TTT TGT TTC T 



10 



M13 sequencing primers; 
M1 3 Reverse 

5* Cy5- CA GGA AAC AGC TAT GAC 
M13 Universal 

5* Cy5- CG ACG TTG TAA AAC GAC GGC CAG T 



Sequencing 

The Rsus 3 promoter region was sequenced with a ALFexpress DNA sequencer. 
The ALFexpress is designed for automated detection of fluorescently labelled DNA 
molecules separated by electrophoresis. 

Cy5-labeiled fluorescent M13 reverse and forward primers for the pCR2.1-TOPO 
vector was used for sequencing of 5' and 3' ends of cloned PCR products. In addition 
Cy5-labelled sequencing primers was designed for specific regions of the Rsus3 
promoter region, using the described OLIGO™ program - Version 5.0 for Windows 
(National Biosciences Inc, Plymouth, Ma). 



15 
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Materials for sequencing 

• ALFexpress™ DNA Sequencer (Pharmacia biotech AB) 

• Thermo Sequenase fluorescent labelled primer cycle sequencing kit with 7- 
5 deaza-dGTP (Amersham Pharmacia biotech) 

• ReproGel™ Long Read, for polyacrylamide gel electrophoresis with the ALF® 
family of instruments(Amarsham Pharmacia Biotech) 

• ReproSet™, for UV polymerisation of Reprogel™ (Amersham Pharmacia 
Biotech) 

10 • ALFwin™ Sequence Analyser 2.00, Windows 95 based program that controls 
ALFexpress (Amersham Pharmacia Biotech) 

Diodeoxy chain terminator sequencing was performed in a thermocycled reaction 
using the special mutant polymerase ThermoSequence supplied by Amersham 
15 PharmacLa Biotech both to accept fluorescently labelled 

primers and to have equal affinity for the four diodeoxy chain terminators. Band 
compressions on the gel were limited by the use of 7-deaza-dGTP in the reaction. 

Sequence analysis 

20 

The resulting sequences was assembled using Winseq 1.01, developed by F.G. 
Hansen, Department of Microbiology, DTU, Denmark. After this preliminary 
sequence assembling, the sequence analysis was performed using the same 
program. 

25 

Isolation of the RSus 3 promoter region from rice genomic DNA 

The rice sucrose synthase 3 (RSus3) promoter region was isolated from rice 
genomic DNA using the chromosome walking technique described by Siebert et al. 
30 1995, which is an adaptor mediated PCR method, designed to amplify an unknown 
sequence which is flanking a known sequence. 

Genomic DNA is first digested with enzymes which produce blunt ended products 
(Dral, EcoRV, Pvull, Seal or Sspl) on to which adaptors are ligated. The result is the 
35 generation of 5 DNA-libraries, which can serve as templates for a PCR, using one 
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primer specific to the known sequence' Lid an adaptor-specific primer which is 
described in Siebert et al (ibid). 

The adaptor sequence and the adaptor primer (AP1) and the nested adaptor primer 
5 (AP2) was provided. The adaptor was designed with an aminogroup in 3' end which 
prevents amplification from the AP1 primer binding site, unless there is an initial 
round of amplification from e.g. a gene-specific primer [After Siebert et al. 1995]. 

The adaptor incorporates two features which ensure that template with adaptor 
b.nd.ng sites at both ends will not be amplified. Firstly, due to an amino group in the 
3'end of the adaptor, the generation of an AP1 primer binding site by extension of the 
lower strand 3' end is prevented. Thus, the only circumstances in which an AP1 
primer binding site is formed during the PCR. are those in which there is an initial 
round of amplification from a primer which binds within the restriction fragment. 
Secondly, amplification of template, formed by unspecific priming from AP1. will be 
suppressed. Fragments formed in this manner contain inverted terminal repeats in 
the single stranded product, which will form a secondary stem-loop structure, which 
is more stable than the template-primer hybrid. The formation of the stem-loop 
structure prevents annealing of primer, which suppresses amplification of unspecific 
PCR products. The specificity of the methodjs further improved by reamplification of 
the product using a nested PCR primer " ~- 



Preparation of genomic DNA 

25 Geno mic DNA wa s obtained from rice. 1 M of this DNA was digested with 20 U of 
either of the following blunt end enzymes: Dral. EcoRV. Pvull. Seal and Sspl in a 
total volume of 50 uL. 

The digestions were run through an enzyme remover column (Amicon) and 
30 concentrated by alcohol precipitation followed by dissolving in 20 uL TE. 

The adaptor was prepared by mixing 800 pmol of each of Oligo 1 and Oligo 2 in a 
total volume of 22 \xL. The mix was denatured by treatment at 94°C for 1 min.. and 
transferred on to ice. 

35 
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An excess of the adaptor was ligated with the blunt-end digests using T4 DNA 
ligase. 4 yiL adaptor and 13 digest were mixed with 10 U T4 DNA ligase in a total 
volume of 30 >iL, and incubated at room temperature for 24 hours. 

5 Excess adaptors were removed after ligation with the Qiaquick PCR purification kit 
(Qiagen), which recovers >100 bp PCR products and thereby also will remove 
excess adaptor. 

PCR screening for the RSus3 promoter region 

10 

For isolation of the RSus3 promoter region, the following sequence-specific primer 
was designed: 

5'-ACG ACG GAA TGG ATA ATA GCA GAT A-3' 

The 3' end of this antisense primer anneals approx. 300 bp downstream of the ATG 
of RSus3. 

The amplification was performed in a Mastercycler (Eppendorf) as a hot started, two 
20 step. PCR (68°/94°C). To ensure optimal product length and limitation of errors the 
proof-reading thermostable DNA polymerase mixture Expand™ (Boehringer M) was 
used. 



For additional optimisation of the amplification, a PCR with one nested primer was 
25 performed. After the initial PCR (primer-pair AP1/Rsus3) f a secondary semi-nested 
amplification (primer-pair AP2/Rsus3) was performed, using the product from the 
initial amplification as template. 

The composition and reaction conditions for the initial PCR are summarised in the 
30 table below. 
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Initial PCR reaction 


PCR program 


Keanpnt 


jjL/reac. 


Step 


Temp (°C) 


Time 


Template (1:20) 


5 




94 


1* 45" 


5xBuffer (E, F, G 


10 


2 


94 


15" 


and H) 










dNTPmix(IOmM) 
AP1 (4 nM) 
Rsus3 (4 |iM) 


4 


3 


68 


4" 


5 


4 


go to step 2, 9 times 


5 


5 


94 


15" 


Water 


20 


6 i 


68 


4' + 


Expand™ 


0.75 






20'Vcycle 


7 


go to step 5, 19 times 






8 


I 


infinite 



_minutes:' and. .seconds^"- 



5 Buffer E, F. G, and H for the PCR optimized Kit (Invitrogen) were chosen for 
optimisation of the buffer conditions. 

The semi-nested PCR was performed as Outlined in the table below. 



Secondary PCR reaction 


PCR program 




Reagent 


fiLyreac. 


Step 


I Temp (°C) 


Time 


Template (1:100) 
SxBuffer (F and H) 
dNTP mix (10 
mM) 

AP2 (4 nM) 
Rsus3 (4 jjM) 
Water 


1 

10 
4 


1 

2 
3 


94 
94 
68 


1'45" 

15" 

A' 


5 


4 


go to step 2. 9 times 


5 

25 


5 
6 


94 
68 


15" j 
4' + 


Expand™ 


0.5 






20'Vcycl 
e 


7 


go to step 5, 1 9 times 






8 


I 


infinite 



10 
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compatible with Sail and. since no internal Xhol sites are present in the RSus3. this Xhol 
site represents the 5' end of all of the native RSus3 fragments. 

Sequencing revealed that the sequence around the ATG start codon in RSus3 was 
CAATGG. So a introduction of the Ncol (CCATGG) in the consensus sequence around 
the ATG only affects a single basepair. For generation of the 3' end of the promoter 
region, the RSus3Nco primer was therefore designed with a Ncol site positioned at the 
ATG start codon for translational fusion of amplified promoter fragments with the uidA 
gene in pGUSNOSt. 

For amplification of the two RSus3 fragments (1450 bp and 2700 bp ) the primer-pair M13 
forward/ RSusNco was used and the RSus3 fragments were amplified from P SspK3 and 
P ScaK3. The M13 primer anneals to the pCR2.1-TOPO part of the pSsp and pSca clones 
some 1 10 bp from the cloning site. 

The two RSus3 fragments were amplified according to the following scheme: 



20 



25 



PCR reaction 


PCR program 




Reagent 


yiL/reac. 


Step 


Temp 
(°C) 


Time 


Template (1:20) 


5 


1 


95 


2' 


5xBuffer (G and H) 


10 


2 


95 


V 


dNTP mix (10 mM) 


4 


3 


55 


V 


M1 3 forward (4 jiM) 


5 


4 


72 


2' 


RsusNcol (4 |xM) 


5 


5 


go to 2, 25 times 


Water 


20 


6 


72 


10' 


Expand™ 


0.5 




4 


0" 



"Qiagen miniprep of pSspK3 or pScaK3 

The resulting products were cloned in pCR2.1-TOPO and then subcloned into pGUSNOSt 
as Xhol/Ncol fragments. Positive clones was identified by restriction mapping of miniprep 
ONA. Although, the pGUSTNOSt was linearised with both Sail and Ncol. the vector was 
treated with phosphatase before ligation to prevent relation of plasmid that was cut with 
only one enzyme due to the close placement of the Sail and Ncol sites 
( d/T CGA C/C ATGGV 
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The NOS terminator was amplified from pDB2 as a 266 bp fragment with the 
E357/E356 primerpair, each of which contained an EcoRI site 5' to the annealing 
region. The amplified fragment was cloned in pCR2.1-TOPO, a positive clone was 
identified, and named pNOStO. This was sequenced in both directions with universal 
5 and reverse sequencing primers (pCR2.1-TOPO) and the resulting sequence was 
verified using the BLAST function in the Entre 2 search engine. NOSt was excised 
from pCR2.1-TOPO with EcoRI and cloned into the unique site for this enzyme in 
P GUSN358->S. The orientation of the fragment in pGUSN358-»S was tested by 
PCR using the primer pairs E3S7/pGUSIower for the correct orientation, and 
E356/pGUSIower for reverse orientation (product of 350 bp). A clone with NOSt in 
the correct orientation was selected, and named pGUSNOSt 2 (5253 bp), which was 
later just referred to as pGUSNOSt, whereas a clone with NOSt in reverse orientation 
was selected and named pGUSNOStl. 

15 Cloning of the RSus3 promoter in pGUSNOSt 



10 



The initial goal was to clone as much as possible of RSus3 promoter into 
pGUSNOSt, without any change of the promoter sequence. 

20 Restriction mapping of the RSus3 promoter localised the following features: 

1 • An inte mal Hindlll site approx. 1200 bp upstream the ATG codon. 
2. A Pstl site in clones originating from rice DNA template digested with Sea I (pSca 
clones), but not in those digested with Ssp I (pSsp clones). 
25 3. Several Sphl sites in both pSca and pSsp clones. 

4. No Sail and Ncol sites were found in any of the cloned fragments. 

These characteristics limited the number of unique restriction sites for cloning of the 
RSus3 promoter in pGUSNOSt. Only Sail and Ncol were available for cloning of the 
30 RSus3 promoter. 

The cloning strategy, was to generate two RSus3 upstream fragments (1450 bp and 
2700 bp) with ends compatible to the Sail and Ncol sites for directional and 
translational fusion of the RSus3 promoter with the uidA gene. The cloning strategy 
35 is outlined in figure 5. 
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The adaptor from the chromosome walking technique contains a Xho! site and 
therefore the isolated RSus3 promoters in the pSsp and pSca clones all contain this 
Xhol site in the 5' end of the promoter region. Overhanging ends generated from 
cutting with Xhol are compatible with Sal! and, since no internal Xhol sites are 
5 present in the RSus3, this Xhol site represents the 5' end of all of the native RSus3 
fragments. 

Sequencing revealed that the sequence around the ATG start codon in RSus3 was 
CAATGG. So a introduction of the Ncol (CCATGG) in the consensus sequence 
10 around the ATG only affects a single basepair. For generation of the 3' end of the 
promoter region, the RSus3Nco primer was therefore designed with a Ncol site 
positioned at the ATG start codon for translational fusion of amplified promoter 
fragments with the uidA gene in pGUSNOSt. 



15 For amplification of the two RSus3 fragments (1450 bp and 2700 bp ) the primer-pair 
M13 forward/ RSusNco was used and the RSus3 fragments were amplified from 
pSspK3 and pScaK3. The M13 primer anneals to the pCR2.1-TOPO part of the pSsp 
and pSca clones some 110 bp from the cloning site. 



20 The two RSus3 fragments were amplified according to the following scheme: 



PCR reaction 


PCR program 


Reagent 


jiL/reac. 


Step 


Temp 


Time 








(°C) 




Template (1:20) 


5 


1 ! 


95 


2' 


SxBuffer (G and H) 


10 


2 


95 


r 


dNTP mix (10 mM) 


4 


3 


55 


V 


M1 3 forward (4 


5 


4 


72 


2' 


RsusNcol (4 


5 


5 


go to 2, 25 times 


Water 


20 


6 


72 


10' 


Expand™ 


0.5 




4 


0' 



Qiagen miniprep of pSspK3 or pScaK3 



25 The resulting products were cloned in pCR2.1-TOPO and then subcloned into 
pGUSNOSt as Xhol/Ncol fragments. Positive clones was identified by restriction 
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mapping of miniprep DNA. Although the pGUSTNOSt was linearised with both Sail 
and Ncoi. the vector was treated with phosphatase before ligation to prevent 
relation of plasmid that was cut with only one enzyme due to the close placement of 
the Sail and Ncol sites (G/TCGA C/CATGG ). 

5 

A positive 1450 bp clone was named "p1450" and a positive 2700 bp clone was 
named "p2700". 



10 



15 



20 



25 



30 



Deletions of the RSus3 promoter 

In order to identify those parts of the cloned promoter regions which confer activity 
and specificity, we performed a molecular dissection of the cloned fragments, by 
effecting serial deletions of regions which were deemed from the DNA sequence to 
contain motifs, identified in the promoters of other genes. 



The generation of RSus3 promoter deletions involved PCR amplification of selected 
parts of the promoter region, and cloning of these fragments into a construction 
which was based on the p1450 construct. It was decided that all truncations of the 
promoter would be upstream of the TATA box. which is positioned 989 bp 5' to the 
translation start codon. and approximately 100bp upstream of intron 1. This strategy 
gave the option of retaining or later removing intron 1 from constructs, while 
focussing deletions upon the region, in which all the recognised promoter motifs are 
located. 

As described above, there are a limited number of restriction sites available in the 
pGUSNOSt MCS for cloning of truncated promoter inserts. In fact, there are no 
unique sites left in the pGUSNOSt part of the resulting clones. But in the adaptor part 
of the cloned 1450 bp and 2700 bp fragments there are a number of restriction sites. 

Only a few restriction sites are unique within the RSus3 part of the p2700 and the 
P1450 constructs. Of these, only BamHI, Bglll, BsiWI and Clal generate cohesive 
ends and are therefore convenient for cloning, but BamHI is located downstream of 
intron 1 and Bglll within it. The BsiWI and Clal on the other hand are located only 
300-400 bp from the MCS and therefore leave very little sequence that can be 
truncated. 
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So the lack of usable cloning sites led to the following strategy: 

The MCS Hindlll site was converted to a Nhel site by partial digestion followed by fill- 
in with Klenow and religation of the resulting blunt ended fragment. PCR amplified 
5 fragments with Nhel/Hindlll could afterwards be cloned directionally in the resulting 
unique Nhel and Hindlll sites. 

This strategy had several advantages, though it is quite laborious. 

10 1) A standard test plasmid was created, which contained unique Nhel and Hindlll 
sites, into which PCR amplified products could be directionally cloned. 

2) The sequence of the Hindlll/Ncol fragment obtainable from all truncated clones 
was identical, so possible variations owing to PCR-generated mutations in this region 
15 were avoided. 

Hindlll-+Nhel conversion for directional cloning of PCR deletions 

The conversion of the Hindlll site in the MCS to a Nhel site involved partial digestion 
20 with Hindlll, filling-in of the 5* single stranded overhang with Klenow fragment, and 
religation which formed an Nhel site. Reaction conditions for effecting a partial 
digestion with Hind III were established empirically, using the range of dilutions of 
enzyme in the tabel below and slowing the reaction by performing it at 25°C, rather 
than 37°C. 

25 



Reaction scheme for the partial digestion: 



sample 


Plasmid 
dilution 1 (nL) 


NEBufffer 2 
(HL) 


Hindlll 

dilution 1 

(HL) 


H 2 o ! 
(hU 


1 


2 


1.5 


2 


9.5 


2 


2 


1,5 


4 


7.5 


3 


2 


1.5 


6 


5.5 


4 


2 


1.5 


8 


3.5 


5 


2 


1.5 


10 


1.5 


6 


2 


1,5 


12 


0 



Dilutions: 0.25 fig/^L plasmid and 0.02 U/jiL Hindlll 
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The partial digests were incubated in a Mastercycler (Eppendorf) for 15 min at 25'C 
followed by 10 min at 65°C. The result was evaluated on a 1% Seakem agarose gel 
and sample 2 and 3 gave the best result. In these two samples there were 4 bands: 2 
5 bands from uncut plasmid. a band from a single Hindlll cut and a faint band from 
cutting of both Hindlll sites. The latter band increased in intensity in the subsequent 
samples, at the expense of the single cut band. 



10 



20 



25 



30 



The partially Hindi ll-digested 1450bp construct was filled in using DNA Polymerase I 
Large Fragment (Klenow) (NEB) before ligation. In the presence of dNTP's, Klenow 
fragment retains the polymerization fidelity of the E.coli DNA Polymerase I without 
degrading 5" termini. Treatment with Klenow fragment was performed as follows: 



14 pL partial digested plasmid 
1 5 2 pL 1 0 mM dNTP mix (Invitrogen) 
0.6 pL NEBuffer 2 (NEB) 
2.9 pL HjO 

0.5 pL Klenow fragment (NEB) 



The reaction was incubated for 15 min at 25«C and 10 min at 75°C. After the filling-in 
reaction the singly cut plasmid band was isolated from the gel. ligated and positive 
clones were isolated. This strategy gives rise to two different types of clones, one 
with an upstream Nhel site ( P 2700(upNhel) 1 and 3). and one with a downstream 
Nhel site ( P 2700(downNhel) 7 and 10). Although the latter type of clone was not 
suitable for promoter truncation studies, it found utility later during the construction of 
tandem repeats of promoter motifs. 

Additionally, the Hindlll site in the MCS of the parent plasmid pGUSNOSt. which 
contained no RSus3 sequence, was modified to an Nhel site in a similar manner, to 
form pGUSNOStNhel 2 and 3. but the partial digestion was of course not necessary 
for the unique Hindlll site in this plasmid. 
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Cloning of three RSus3 deletions in p2700(upNhel) 

See Figure 6. 

5 

Three deletions were generated using the three primer-pairs with different upper 
primers, but the same lower primer NhedeM/Dellowl, Nhedel2/Dellow1 and 
Nhedel3/Deliow1 . The Nhedel1-3 primers was designed with a Nhel site and the 
Dellowl primer was designed to anneal downstream of the internal Hindll! site. Thus 
10 these primer-pairs results in generation of 3 different PCR products (650 bp, 590 bp 
and 480 bp), with the 2700bp construct as template, each with Nhel/Hindlll sites for 
cloning in p2700(upNhel). 

The amplification was performed in a Mastercycler (Eppendorf) as a 2-step PCR 
15 (68/94°C) described in the table below. 



PCR reaction 


PCR program 


Reagent 


jiL/reac. 


Step 


Temp (°C) 


Time 


Template (1:100) 


1 


1 - 


95 


2* 


SxBuffer (E,F,G or 


10 


2 


94 


45" 


H) 










dNTP mix (10 


4 


3 


68 


45 M 


mM) 










Nhedell.7,3 (4 


5 


4 


go to 2, 25 times 


HM) 










Dellowl (4 ^M) 


5 


5 


68 


10' 


Water 


24,5 


6 


4 


infinite 


Expand™ 


0.5 









The PCR products were cloned in pCR2.1-TOPO and positive clones were identified 
20 by restriction mapping. Nhei/Hindlll fragments from these were isolated and cloned 
into p2700(upNhel) 1. 

Three positive clones were identified and named w p1730 n clone 3, "p1670" clone 3 
and tt p1560 M clone 3. 
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The p1160 construct 



See Figure 7. 

5 

This deletion was generated by cutting the P 2700 construct Hindlll. The vector was 
religated and two positive clones were named H p1 160" clone 1 and 2. 

The p2700-prolamin construct 

10 

See Figure 8. 

Whereas all the RSus3 deletions described above are 5* end deletions, this 
truncation comprises an internal deletion of 161 bp between the internal Hindlll site 

15 and the TATA-box in the RSus3 promoter. Based on the RSus3 sequence, a 
Primerpair (RsusTATA/RsusNcb) was designed for amplification of a 1020 bp 
fragment, with a Hindlll site at one end and an Ncol site at the other. This product 
therefore comprises Intron 1 flanked by the TATA box, and the ATG translational 
start codon. The primer RsusTATA contained an internal Hindlll site and annealed 

20 just upstream the TATA-box. The primer RsusNco, which contains a Ncol site 
positioned at the ATG start codon for translational fusion of amplified promoter 
fragments with the uidA gene, was the same as described above in section 5. 

The PCR reaction was performed as a 2-step PGR (68/94°C) after the following 
25 scheme in a Mastercycler (Eppendorf). 



PCR reaction 


PCR program 


Reagent 


HUreac. 


Step 


Temp (°C) 


Time 


Template (1:100) 


1 


1 


95 


2 f . 


5xBuffer (A and B) 


10 


2 


94 


45" 


dNTP mix(10mM) 


4 


3 


68 


45" 


RsusTATA (4 nM) 


5 


4 


go to 2, 30 times 


RsusNco (4 nM) 


5 


5 


68 


10' 


Water 


24,5 


6 


4 


infinite 


Expand™ 


0.5 
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The 1020 bp Hindlll/Ncol product was generated by PCR and cloned into pCR2.1- 
TOPO. Positive clones was identified, and the Hindlll/Ncol fragment was cloned into 
2700bp(upNhel) (linearized with Hindlll and Ncol for substitution of a corresponding 
1160 bp fragment). A positive clone was isolated and named M p2700-prolamin n clone 



Removal of Intron 1 within the RSus3 promoter region 



In order to create a promoter lacking Intronl , screening for the S'-untranslated region 
10 of an RSus3 clone in a rice cDNA library, and splicing of this to the upstream 
promoter motifs already cloned, were initially considered. However, sequence 
analysis revealed the 3* acceptor splice site of intron 1 to be located only 27 bp 
upstream of the ATG-codon. In light of this, a preferred strategy was formulated in 
which the sequence of this 27 bp downstream non-coding exon was incorporated in 
1 5 the lower strand PCR primer used to amplify the promoter. 



See Figure 9. 



A primer pair was designed for PCR amplification of-the RSus3 promoter region from 
20 pSca and pSsp clones. The lower primer (LowexonVi) was designed as follows: The 
3' end of the primer corresponded to a 24 bp sequence just upstream for the putative 
5' splicing site in intron 1. The 5' end of the primer corresponded to the 27 bp 
sequence between the 3' splicing site and the ATG-codon (Ncol). The upper primer 
UppCR2.1 was designed to anneal to a 44 bp sequence of pCR2.1-TOPO, upstream 
25 of the M13 reverse primer site, for amplification of the RSus3 promoter without intron 
1 from pSca and pSsp clones. 

Both primers were quite long (LowexonYa is 56 bp and UppCR2.1 is 44 bp), and to 
ensure that they were full length, they were ordered HPLC purified. LowexonVi has 
30 the possibility to form a hairpin loop with a melting temperature of 56°C and an 
internal BamHI site gives a stable primer dimer, so the PCR amplification was 
performed using AmpliTaq Gold an a two step program (68°C/94°C). 

Due to the potential of these long primers to form partially miss-matched double- 
35 stranded secondary structures at their 3' end, AmpliTaq gold was used instead of 
Expand™ polymerase, in order to avoid the modification of the oligonucleotides by 
the proof-reading Pwo polymerase. 
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The result was a very faint band of the right size, so a semi-nested PCR amplification 
with M13 reverse/RsusNcol was performed using purified product as template. 

5 This nested PCR gave a more intense band, which was cloned in pCR2.1-TOPO. A 
positive clone was identified and a Xhol/Ncol fragment was subcloned in pGUSNOSt 
(Analogous to subcloning of Xhol/Ncol fragments from pScaK3 and P Ssp3). A 
positive clone was named u p1450-intron" clone 2. 



io 



20 



25 



Generation of tandem-repeat RSus3 promoters 



In an attempt to enhance the activity of the RSus3 promoter, a range of sequences 
upstream the TATA-box in RSus3 were doubled. This strategy resulted in 3 tandem- 
repeat RSus3 promoters, in which different parts of the RSus3 promoter region had 
15 been doubled. 

See Figure 10. 

DNA sequence analysis located the majority of putative cis-elements to be within 700 
bp (-1709 bp to -1027 bp), upstream of the TATA box. Therefore these 700 bp 
contained the elements to be doubled. - — - 

The three tandem-repeat promoters were constructed by amplification of specific 
sequences of the promoter, cloning of these into pCR2.1-TOPO. and finally 
subcloning the cloned fragments in either the Nhel site or the Hindlll site in the 
p1 730 e onstruct. - 

Tandem legumin 

30 The repeated sequence in the tandem legumin promoter were amplified after same 
description as Nhel/Hindlll product in p1730. but with P 2700(downNhel) as template. 
This gave a similar product, only with two Nhel sites, which after cloning into pCR2.1- 
TOPO. was subcloned as an Nhe I fragment into the Nhel site of p1730. A positive 
clone was identified by Nhel digestion, and a the orientation was established by 

35 repeating the PCR using the subclone as template. The resulting clone was named 
"pTandem legumin" 
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pTandem leg/pro 

This tandem-repeat promoter was constructed in a similar approach. A primer pair 
was designed for amplification of a 750 bp fragment from the p2700 construct. The 

5 primer pair each had an Nhel site (Nhedell used for p1730 t and NheGCN4). 
Amplification of the primer pair Nhedel1/NheGCN4, was again performed, after same 
description as for the p1730 construct, and resulted in a 750 bp product (with Nhel 
sites in both ends), which were cloned in pCR2.1-TOPO, and afterwards subcloned 
into the Nhel site of the p1730 construct. A positive clone was identified by 

10 restriction digest, the orientation was determined in the same manner, and the clone 
was named "pTandem leg/pro". 

pTandem prolamin 

15 This tandem-repeat prom Mrnost the same manner as the 

tandem leg/pro. The primerpair had same upper primer, and an analogous lower 
primer with a Hindill site (HindGCN4). The resulting product was cloned into pCR2.1- 
TOPO and a 170 bp Hindlll/Hindlll fragment from this was subcloned in the Hindill 
site in the p1730 construct. A positive clone was identified by restriction digest, the 

20 orientation of which was determined by repeating the PCR using the subclone as 
template The resulting clone was named "pTandem prolamin". 

Transient ballistic transformation with the Particle Inflow Gun (PIG) 

25 The Particle Inflow Gun (PIG) used in this study was in accordance with Finer et al. 
1992 and Vain et al. 1993. The PIG - which is placed on a sterile-bench for 
minimizing contaminants - comprises a vacuum chamber with a digital vacuum 
sensor, an electronically operated solenoid gas valve attached to a helium cylinder 
regulator (1-16 bar), and a three-way valve connected to a vacuum pump. The 

30 solenoid regulates the helium flow, and is controlled by a timer relay which allows 
gated flow in pulses down to 25 milliseconds. The PIG is operated from a control 
panel with a timer relay, a digital display vacuum gauge and a fire-button. 

The vacuum chamber is constructed from 3 mm stainless steel with a 20 mm acrylic 
35 door and a 10 mm inner lining of polypropylene with grooves at every 20 mm for a 
shelf. A silicone foam gasket between the chamber and the door seals the vacuum 
chamber. 
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A ballistic transformation device, similar to the PIG, was used in a preliminary 
experiment to test the feasibility of transformation of guar endosperms by particle 
bombardment. 

Preparation of guar tissue for transformation 

The primary target for particle bombardment in this study was the guar endosperm, 
but both emerged and pre-emerged cotyledons, as well as juvenile leaves, were also 
used as comparative tissue. Endosperms and emerged cotyledons were isolated 
from 3-4 weeks old guar pods, pre-emerged cotyledons were harvested from 12 and 
20 day old in vitro grown seedlings, and juvenile leaves were harvested from mature 
guar plants. 

Isolat ion of guar tissue 

Endosperms were isolated from guar seeds, under aseptic conditions, after the 
following procedure: 

20 The sterilised guar pods were opened, and the seeds taken out. For each seed, the 
coat was carefully removed, and the remaining part was divided in endosperm and 
embryo. The endosperm was divided in two identical halves, and the convex dorsal 
side of both halves was the target surface for particle bombardment. This bisection 
and orientation minimised dispersal of the tissue by the helium blast. The pre- 

25 emerged cotyledons were separated from the embryo, whereas emerged cotyledons 
were isolated from seedlings grown in vitro in a 12 hour day/night regime. Young 
guar leaves were excised from full-grown plants. 

The isolated tissue was transferred to the 60 mm petri dishes containing solid 
30 agarose medium and a filter paper disc (see section 1.4.3). The tissue were placed 
as follows: Endosperms in the centre of the petri dish with dorsal side up; cotyledons 
from seedlings with the upper side facing up in the centre of the petri dish; Embryo 
cotyledons together with endosperms. 

35 A particle bombardment experiment consisted of 5 to 6 repeated bombardments per 
construct. 6-8 endosperms, 2-3 cotyledons from seedlings, or 6-7 endosperms plus 
4-5 pre-emerged cotyledons were bombarded in each discharge of the PIG. For 
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comparative analysis, both cotyledons and endosperms were bombarded in same 
experiment. 

Sterilisation of guar tissue 

5 

No antibiotics were used in the transformation, so microbial contamination must be 
avoided. Therefore a fastidious sterilisation procedure was very important. 

Sterilisation of guar pods from mature guar plants were accomplished as follows: The 
10 pods were washed in sterile water, submerged for 1 min in 96 % EtOH, and finally 
washed 3 times with sterile water. Further sterilisation of the guar pods and seeds 
was not necessary. 

Sterilisation of new leaves from mature guar plants was accomplished as follows: 
15 The leaves were washed in sterile water, submerged for 10s in 96 % EtOH and for 
20 min in 0.15 % NaOCI and 0.05 % Triton X-100. Finally the leaves were washed 3 
times with sterile water, and stored in water until use. 

Cotyledons from in vitro grown seedling were sterile and no sterilisation was 
20 necessary. 
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The tissue was bombarded and incubated on the following medium. The medium 
was a solid medium, and was based on a recipe from Donovan & Lee 1977, with a 
5 few modifications in the concentration of the ingredients. 





Compound 


Concentration 






(mg/L) 


Salts 


Sucrose 


20,000 




M&S mix 1 


4,300 


Vitamins 


Myo-inositol 


100 




Thiamine 


0.4 


Aminoacids 


L-Alanine 


89 




L-Arginine 


126.5 




L-Asparagine 


117 




L-Aspartic acid 


100 




L-cysteine 


69 




L-glutamic acid 


664.5 




L-Glutamine 


665.5 




Glycine 


156 




L-Histidine 


112 




L-lsoleucine 


174 




L-Leucine 


274.5 




L-Lysine 


77 




L-Methionine 


67 




L-Phenylalanine 


287.5 




L-Proline 


443.5 




L-Serine 


202.5 




L-Threonine 


96 




L-Tryptophane 


164 




L-Tyrosine 


144 




L-Valine 


161.5 


Agar 


Agar 


8,000 




type A 1296 



1 Murashige & Skoog basal salt mixture M5524 
10 The particle bombardment procedure 

The particle bombardment experiments in this study had been performed as 
described below. 
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DNA coating of gold particles 

A typical transformation with a single plasmid, comprising five repeated discharges of 
the PIG, employed 3 mg of gold particles. In practice, a larger batch of gold particles 
than this was freshly prepared at one time, and used for several transformations 
immediately afterwards. The procedure described below for the preparation of 3mg 
of gold particles was adjusted such that amounts and volumes increased in 
proportion to amount of material 

1. 3 mg of gold particles (1.6^m, Biorad) were treated with 50 jiL 99.9 % EtOH 
(Danisco Distillers). 

2. The gold particles were vortexed in the ethanol for 3 min (full speed) and 
centrifuged at 10.000 g for 1 mih. 

3. The s upematant was removed and resuspended in 50 juL of sterile H2O ; 

4. The particles were centrifuged at 10.000 g for 1 min, the supernatant was 
removed and this washing with water repeated. 

5. The particles were resuspend in 50 50% w/v glycerol (sterile). 

20 The gold particles were then coated with plasmid DNA in the following manner. 

1. 50 of gold particle suspension were removed during vortexing (14.000 min" 1 ) 
on an thermomixer (Eppendorff). 

2. While vortexing (14.000 min" 1 ) this 50 sample of gold particles, the following 
25 components were sequentially added. 



5 



10 



10 \iL of plasmid (10 |ig/ML) 

50 *iL 2.5 M CaCI 2 

20 fiL 0.1 M Spermidine 

30 

3. Vortexing was continued for 3 min at 16.000 min 1 The sample was t 
sedimented at 10.000 g for 10 s. and the supernatant was removed. 

4. The sample was resuspended in 250 99.9 % EtOH, and centrifuged at 
same force for 10s. 

35 5. The supernatant was removed and the particles resuspended in 40 99J 
EtOH. 
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6. The DNA coated microprojectiles were stored at ice until use. 

Because the gold particles tend to agglomerate irreversible in aqueous solutions, it 
was necessary to prepare them immediately prior to use [Kikkert 1993]. CaCI 2 can be 
5 store at 4°C, but spermidine (N-[3-aminopropylJ-1.4-butanediamine) deaminates with 
time, and solutions must be freshly made every month and be stored frozen at -20°C. 

The purpose of the spermidine in the coating procedure is to condense the DNA by 
shielding the negative charges on the DNA phosphate backbone, thereby allowing 
10 hydrophobic interactions to predominate [Bloomfield 1991 J. 

Operating the PIG 

The PIG was operated in accordance with the following procedure 



I5 



1 . A petri-dish (60 mm in diameter) with the target tissue was placed in the vacuum 
chamber. 

2. A stainless steel mesh (250 nm-mesh, 13 mm in diameter) was placed in the filter 
unit. 

20 3. 5 nL of a suspension of DNA-coated micro-projectiles were loaded on to the 
centre of the steel mesh. 

4. The filter unit was assembled and attached to the helium solenoid valve. 

5. The 250 nm-mesh stainless steel protection screen was placed on top of the 
petri-dish. 

25 6. The door and the valves were closed and evacuation of the chamber was begun. 

7. The helium burst was released by pressing the fire button exactly at the point 
when the desired vacuum was reached. 

8. The vacuum was released by opening the evacuation port, and the tissue was 
removed . 

30 9. The cycle was repeated until all the samples had been bombarded. The filter unit, 
mesh and protection screen were changed for every new construct. 
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Bombardment conditions 

Good results for ballistic transformation of guar endosperms, embryos, and 
5 cotyledons were obtained with the following bombardment conditions. 

• The vacuum chamber was evacuated to a partial vacuum of 0.1 bar . 

• The target was placed at a distance of 16 cm, measured from the mesh in the 
filter unit to the target tissue. 

• The helium pressure on the regulator was set at 5 bar. 

• The timer relay was set at 50 milliseconds. 

• A stainless steel protective screen (250 >im-mesh) was placed approx. 2 cm 
above the target tissue 

• Gold micro-projectiles with a diameter of 1.6 Lim (Biorad) were used to carry the 
1 5 plasmid DNA. 

Post bombardment 



After bombardment the petri-dishes, containing bombarded tissue, were sealed and 
incubated for 48 hour at 25°C. Endosperms and embryos were incubated in the dark, 
leaves and cotyledons were incubated in a 12 hour day/night regime. 

GUS assay 



25 The histochemical GUS assay was performed as described by Jefferson 1987. After 
the incubation period the bombarded tissues were transferred to microtitre plates, 
submerged in GUS-assay buffer and incubated in dark for 24 hours at 37°C. 

GUS-assay buffer: 

30 

100 mM Sodium-phosphate buffer, pH 7.5 
0.5 mM Potassium ferricyanide (III) 
0.5 mM Potassium ferrocyanide (II) 
10 mM Na 2 EDTA (Titriplex III) 
35 1.9mMX-Gluc 
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A method for determination of GUS expression, and thereby promoter activity, by 
counting the number of blue spots was adapted from Knudsen & Muller [1991]. After 
bombardment, incubation and GUS-assay the number of blue spots (expression 
units) was counted under a microscope, and expressed relative to the area 
5 examined. 

The cotyledons and leaves were cleared in 96 % EtOH in several washing steps, and 
were stored in 70 % EtOH until counting of blue spots. 

10 Blue spots in endosperms had to be counted immediately after the GUS incubation 
period. Neither storage in alcohol nor water were applicable, due to high content of 
galactomannan and water. The water caused swelling of the endosperm and ethanol 
made the endosperm shrink. These changes destroyed the expression pattern on the 
surface of the endosperm. 

15 

Presence of sucrose synthase in guar endosperm 

A typical developmental increase in sucrose synthase, that coincides with seed-fill, 
was verified in extracts from guar endosperm using a continuous sucrose synthase 
20 assay (see Figure 11). Details on the sucrose synthase assay are as follows. 

Sucrose synthase assay 

The sucrose synthase activity in developing guar endosperms was assayed in the 
25 direction of sucrose cleavage. The sucrose synthase catalyses the cleavage of 
sucrose in the presence of UDP into fructose and UDP-glucose: 



sucrose + UDP c=> fructose + UDP-glucose 



30 The activity of sucrose synthase was assayed by monitoring the formation of UDP- 
glucose in a continuous enzyme reaction described by Keller et al. (Keller F, Frehner 
M & Wiemken A (1988) Sucrose Synthase, a Cytosolic Enzyme in Protoplasts of 
Jerusalem Artichoke Tubers (Helianthus tuberosus L.) Plant Physiol. 88, 239-241). 
The formation of UDP-glucose was coupled to the reduction of NAD* in the presence 

35 of UDP-glucose dehydrogenase (E.C. 1.1.1.22), which catalysis oxidation of UDP- 
glucose to UDP-glucuronic acid: 
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UDP-glucose + 2 NAD* + H 2 0 co UDP-glucuronic acid + 2 NADH 

The endosperms were dissected from the developing seeds, pooled and 
homogenised in Hepes/KOH buffer (20 mM. pH 8.00) using a rotating pestle. During 
5 this procedure enzymatic breakdown of the galactomannan backbone was effected 
by addition of p-mannase (E.C. 3.2.1.78, Megazyme). The sample was centrifuged at 
20,000 g for 20 minutes and the resulting supernatant was desalted using 
Sephadex® G-25 columns (NAP™-5, Pharmacia Biotech), for removal of endogenous 
sugars from the crude extract. 

I0 

An appropriate amount of enzyme extract was added to the sucrose synthase assay 
buffer (1 ml): 

20 mM Hepes/KOH (pH 8.0) 
15 200 mM sucrose 



2 mM UDP 
1.5 mM NAD* 

20 mU UDP-glucose dehydrogenase (Sigma) 

20 The reduction of NAD* to NADH was followed by continuous measurement of the 
absorbance at 340 nm in a spectrophotometer (25°C, e NAD H = 6300 I mol' 1 cm* 1 ). The 
amount of UDP-glucose produced (^mol min 1 ) was calculated from the slope of the 
resulting absorbance curve. The total amount of protein, in the remaining 
supernatant, was determined using the Biorad Protein Assay with bovine serum 

25 albumin-^s-staodard, and the specific activity of sucrose synthase was calculated 
(Units sucrose synthase per mg protein, or jamo! min" 1 mg" 1 ). 

The activity of sucrose synthase was measured in extracts from guar endosperms at 
various developmental stages from 7-41 days after flowering. The mean sucrose 
30 synthase activity for each developmental stage was obtained from 3 to 4 independent 
measurements, each of which was made with extract obtained from at least 5 
endosperms from same pod. 
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RESULTS 

Isolation of the RSus3 promoter region 

5 

The rice sucrose synthase 3 (RSus3) promoter region was isolated from rice 
genomic DNA using the chromosome walking technique described by Siebert et al. 
1995. After the semi-nested PCR reaction, 3 bands appeared in the following 
reactions: Dral (=700 bp), Seal (=3000 bp) and Sspl (=1800 bp). 

10 

The 1800bp and 3000bp products were verified, as specific products obtained from 
the primer-pair AP2/RSus3, by single primer control reactions. 



TemplateVPrimer 


AP2 


RSus3 


AP2/RSus3 


Seal (buffer H) 






=3000 bp 


Sspl (buffer H) 






=1800 bp 



15 The result of this single primer control reaction shows that the 1800 bp and 3000 bp 
originated from specific amplification from the AP2/Rsus3 primer-pair and are not 
single primer products. Single primer product from the AP2 is most likely the result 
of amplification from two adaptors. 

20 The 1800 bp and 3000 bp products were cloned into pCR2.1-TOPO. Nine clones of 
the 1800 bp ligation and seven clones of the 3000 bp ligation were isolated. 
Restriction mapping with EcoRI showed that of these, four 1800 bp clones and six 
3000 bp clones had insert between the two EcoRI sites. 

25 The resulting pCR2.1 clones with 1800 bp and 3000 bp inserts were partially 
sequenced using M13 universal and reverse sequencing primers specific for pCR2.1- 
TOPO. Downstream sequences for the 1800 bp and 3000 bp clones were compared 
to the published part of the RSus3 sequence [Huang et al. 1996 (GenBank accession 
number: L03366)] using the alignment feature in the Winseq program. 

30 

All four 1800 bp clones were shown to have the upstream part of transcribed region 
of RSus3 inserted (two clones of each orientation). These clones were named pSsp 
A2 t K1, K3 and K4. 
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Four 3000 bp clones were also shown to have the upstream part of the transcribed 
region of RSus3 inserted (with both orientations represented). These clones were 
named pSca K3, K5, New1 and New5. 

5 

See Figure 2. 

Presentation of sequence data 

10 The clone pTBR-ScaK3 (pScaK3) was chosen for total sequencing of the RSus3 
promoter region. This clone was sequenced on both strands after the sequencing 
plan outlined in figure 12. 

The other positive RSus3 clones described above was partially sequenced. The 
15 sequencing of the RSus3 clones resulted in determination of 2700 bp upstream the 
ATG codon (see SEQ ID No. 1). 

DNA sequence analysis 

20 The sequence for the RSus3 promoter region was analysed for the presence of 
putative cis-elements involved in the regulation of gene expression. The region 
upstream of the translational start codon was examined for the conserved TATA and 
CAAT consensus sequences, and the region was examined for the presence of 
putative endosperm or seed specific elements. 

25 

Additionally the region between the TATA box and the translational start codon was 
examined for the presence of intron donor and acceptor splice sites. 

The translational start codon was chosen as basis for assigning bases in the RSus3 
30 promoter, i.e. the A in ATG was numbered +1 (E.g. the base-pairs around the 
CA ATG G are numbered -2, -1, +1. +2, +3 and +4). 

Alignment of the RSus3 promoter region, with the promoter region region from 
RSusI (GenBank accession number: X59046) and from RSus2 (GenBank accession 
35 number: X64770), gave no obvious similarity. 

The results of the DNA sequence analysis are summarised in Table 1 (see earlier). 
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The GCN4 box and the three endosperm boxes are cis-elements involved in 
endosperm specificity and the legumin boxes are cis-elements involved in seed 
specificity. Additionally the -1072 bp endosperm box and the GCN4 motif constitutes 
5 a putative prolamin element, which also is involved in endosperm specificity, although 
the distance between the endosperm box and GCN4 motifs in RSus3 is greater than 
this (34 bp), the sequence shows high similarity to the consensus sequence. 

The GCN4 box, the two endosperm boxes and the majority of the legumin boxes are 
10 located within 700 bp spanning from -1023 bp to -1707 bp. This 700 bp sequence are 
located just upstream the TATA box. 

A restriction map of the RSus3 promoter region is presented herein as Figure 3. 
15 Presentation of RSus3/GUS/NOSt constructs 



In this study, various RSus3 promoter sequences were fused with the uidA gene for 
generation of a series of RSus3/GUS expression cassettes. Specific deletions of the 
RSus3 promoter were constructed in an attempt to identify those parts of the 
20 promoter that are involved in tissue-specificity. A series of tandem-repeat promoters 
were constructed in an attempt to" enhance the promoter activity, and the intron 1 
was removed from the promoter region, in order to establish whether it is necessary 
for expression in guar endosperm. 

25 The c onstructs w ere generated either by amplification of specific fragments by PCR 
or by subcloning. For verification of the resulting constructs, control restriction 
digests, typically with Hindlll, Nhe I, and Ncol were performed. Additionally, control 
PCR or sequencing reactions were performed in some cases. 

30 Common features in all variants of the RSus3 promoter were, translational fusion to 
the uidA gene, presence of the TATA box, and conservation of region downstream of 
this (with the exception of the single construct from which the intron was removed). 
The series of RSus3 promoter constructs centred around the upstream 700 bp 
region distal to the TATA box. As described above, this sequence contains the 

35 majority of putative cis-elements involved in specificity. 

The various RSus3 promoter constructs are summarised in figure 13. 
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p2700 construct (plasmid size 7938 bp) 

An RSus3 promoter of 2700 bp was initially constructed. This construct constituted 
5 the full-length RSus3 promoter, and was prepared by amplification of a 2700 bp 
Xhol/Ncol fragment from pTBR-ScaK3 into Sall/Ncol in pGUSNOSt. Positive clones 
were verified by Ncol/Hindlll digests, which resulted in 2 specific fragments, of 1160 
bp and 1540 bp, in addition to the 5230 bp Ncol/Hindlll fragment from the 
pGUSNOSt part. 

10 

p1730 construct (plasmid size 6963 bp) 

This 5' truncation of RSus3 was constructed by generation of a 571 bp Nhel/Hindlll 
fragment spanning from -1160 bp to -1730 bp. After changing of the upstream 
15 Hindlll site to a Nhel site in p2700, the Nhel/Hindlll fragment generated was 
subcloned into this p2700(upNhel). Positive clones were identified by an Nhel/Hindlll 
control digest, and this resulted in a specific fragment of same size as the subcloned 
fragment. 

20 p1670 construct (plasmid size 6907 bp) 

This 5' truncation was analogous to p1730, comprising a 515 bp instead of 571 bp 
Nhel/Hindlll fragment, positive clones of which were identified in the same way. 

25 p1560 construct (plasmid size 6791 bp) 

This 5* truncations was also analogous to p1730, comprising a 400 bp instead of 571 
bp Nhel/Hindlll fragment, positive clones of which were identified in the same way. 

30 p1450 construct (plasmid size 6704 bp) 

This construct was analogues to p2700, with ligation of a 1450 bp Xhol/Ncol 
fragment amplified from pSspK3 template, instead of the same amplified from 
pScaK3. A Ncol/Hindlll digest resulted in two specific bands of 1160 bp and 290 bp, 
35 besides the 5230 bp pGUSNOSt fragment. 

p1160 construct (plasmid size 6390 bp) 
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This truncated promoter constitutes removal of 1540 bp upstream the internal Hindlll 
site, and was generated by cutting p2700 with Hindlll, followed by religation. Positive 
clones were identified by Hindlll digest, which gave a single fragment in positive 
5 clones. 

p2700-prolamin construct (plasmid size 7777 bp) 

This internal truncation of 161 bp was produced by amplification of a 1020 bp 
10 fragment comprising the TATA box to the ATG codon. This fragment was cloned in 
p2700(upNhel) as a Hindlll/Ncol fragment. Positive clones were verified by a 
Hindlll/Ncol digest, which resulted in a specific fragment of 1000 bp and a fragment 
of 6770 bp. 

1 5 pi 450-i ntron ( plasm id size 5837 bp) 

In this construct, the intron 1 was removed by exactly splicing of the putative intron 1 
acceptor and donor splice sites. This internal deletion of the intron "1 was generated 
by amplification of a region of p1450 which corresponded to RSus3 lacking the intron 
20 1. After cloning in pCR2.1-TOPO, an Xhol/Ncol fragment containing this was 
subcloned in in pGUSNOSt. A positive clone was identified by restriction digestion 
with Hindlll/Ncol, and the 3' promoter end was verified by sequencing. 

pTandem legumin (plasmid size 7536 bp) 

25 

This tandem repeat promoter was constructed by generation of a 570 bp Nhel 
fragment spanning from -1160 bp to -1730 bp by PCR. This fragment was 
subcloned in the Nhel site in p1730. A positive clone was identified by control 
digests and a control PCR reaction with NhedeH/Dellowl verified the orientation. 
30 The right orientation resulted in two bands of 650 bp and 1300 bp, whereas the 
reverse orientation only resulted in a 650 bp product. 

pTandem leg/pro (plasmid size 7704 bp) 

35 This tandem-repeat promoter was constructed by PCR amplifaction of a 740 bp Nhel 
fragment spanning form -990 bp to -1730 bp. This fragment was subcloned in the 
Nhel site in p1730. Positive clones were first identified by an Nhe I digest, and those 
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of appropriate orientation were determined by Hindlll digest. This was possible due 
to the internal Hindlll site in the promoter, which was duplicated in this construct. 
Correct orientation resulted in a 740 bp Hindlll fragment, whereas the reverse 
orientation resulted in a 1140 bp Hindlll fragment. 

5 

pTandem prolamin (plasmid size 7135 bp) 

This tandem-repeat promoters was constructed by generation of a 170 bp Hindlll 
fragment, spanning from -990 to -1160 bp by PCR, which was subcloned in the 
10 Hindlll site in p1730. A Positive clone were identified by control digests, and the right 
orientation was determined by control PCR. Amplification with primer pair 
Nhedel1/HindGCN4 resulted in two bands of 600 bp and 750 bp for clones with right 
orientation, and to bands for of 300 bp and 750 bp for clones of reverse orientation. 

15 See Figure 15. 

Transient expression experiments 

The RSus3 expression cassettes described above were used to effect a transient 
20 transformation of guar tissue using the particle bombardment technique. 

These experiments served several purposes. Firstly, for testing the usefulness of the 
method for generation of transient transformed guar tissue. Secondly, the purpose 
was to set up and optimise the Particle Inflow Gun (PIG) in Danisco Biotechnology, 
25 Holeby. Finally, the main purpose was to evaluate the strength and specificity of 
promoters, such as RSus3, in guar tissues. 

Control 

30 By way of a control experiment, a control plasmid comprising the uidA gene fused to 
the ENOS promoter was constructed. It is known that the ENOS promoter has high 
levels of expression in both endosperm and leaf tissue. 
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Transient expression of RSus3/GUS/NOSt in guar tissue 

A total of 1 1 sets of particle bombardment experiments were performed for testing 
the promoter strength and specificity of the RSus3 promoter region. The above 
5 described RSus3/GUS/NOSt constructions had been tested in this period. The 
procedures for the bombardment, for the preparation of guar tissue and for the 
coating of gold particles are described above. 

The first approach was to test whether the RSus3 promoter was active in guar 
io endosperm or not. p2700 and p1450 were bombarded into guar endosperms, and 
the result showed significant expression of GUS in endosperm for both, constructs, 
although the expression was lower than for the ENOS control. These results were so 
promising that the subsequent work was concentrated around this promoter. 

15 I h „e second appro RSus3 was specific for the endosperm 

or whether it had expression in other parts of the plant. The initial experiment, which 
was performed on cotyledons from in vitro grown seedlings, showed differences in 
expression between p2700 and p1450. p2700 had almost no expression of GUS in 
cotyledons, whereas some cotyledons transformed with p1450 displayed significantly 

20 higher expression. 

To test whether this expression pattern for p2700 and p1450 was the same in leaves, 
these constructs were also used to perform transient transformation of juvenile 
leaves from mature plants. This transformation turned out to be difficult due to 

25 severe damage of the leaf tissue caused by the helium blast Not only was the 
expression of RSus3 concentrated in the vascular tissue of the leaves, but also the 
ENOS control showed same expression pattern indicating that optimisation of the 
procedure for leaves were necessary. Due to limitations in time, however, these 
optimisation experiments were abolished after a while, and instead it was chosen 

30 exclusively to bombard cotyledons from seedlings and from seeds. 

The transient GUS expression, directed by the two RSus3 promoter constructs, 
p2700 and p1450, justified further analysis of the promoter strength and tissue 
specificity. As described above a series of deletions of the RSus3 promoter were 
35 constructed and these were tested in guar endosperm, guar embryo cotyledons and 
cotyledons from 12 and 20 day old seedlings. Additionally, in an attempt to enhance 
the promoter activity, 3 tandem promoters were constructed and the resulting 
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constructs were also tested in same tissue for evaluation of promoter strength and 
specificity. 

Figure 16 presents transient GUS expression data from the control ENOS promoter, 
5 the RSus3 promoter construct p1730 and the RSus3 promoter construct pTandem 
leg/pro (tandem repeat). The results, which are in accordance with the findings for 
the other RSus3 promoter constructs, have been discussed above. These results 
show selective endosperm expression of an NOI. 

10 The results of transient GUS expression in guar endosperms, after bombardment 
with RSus3 constructs, are summarised in figure 17. 

The data show that the promoter of the present invention causes selective 
expression of the NOI in endosperm. In addition, a tandem repeat of the promoter of 
15 the present invention causes an increase in the expression levels. 

As can be seen at figure 17 the expression in guar endosperm are only slightly 
affected by deletion of the sequence between 2700 bp and 1160 bp. The 1160 bp 
has still significant expression indicating that the cis-elements which caused the main 
20 part of expression in guar endosperm are present between 1160 bp and the TATA 
box (+989 bp). This part has been removed in the p2700+prolamin construct, and 
this result in a loss of 25 % of the promoter activity compared to p2700. 

pTandem leg/pro gives highest expression in endosperms, 3 times higher than the 
25 2700 construct and the level of expression is comparable to NOS expression in 
endosperms. 

No blue spots were observed when GUAR tissues were bombarded with 
microprojectiles treated in the same manner, only without DNA, and when the PIG 
30 was discharged without microprojectiles. Bombardment with the promoterless GUS 
construct (pGUSNOSt) gave only a few blue spots, presumably caused by 
illegitimate recombination into the chromosome adjacent to a promoter. 

For comparative analysis of expression in endosperms versus cotyledons, values 
35 relative to the area of the bombarded tissue were calculated. An average 4 weeks 
old endosperm comprises an area of 0.2 cm 2 , and the same for an average pre- 
emerged cotyledon. An average 12 days cotyledon covers an area of 1.5 cm 2 , and 
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an average 20 clays cotyledon an area of 2.8 cm 2 . Therefore the number of blue 
spots per 12 day or 20 days cotyledon were corrected to number of blue spots per 
0.2 cm 2 . 

5 Figure 18 shows that the only highly significant change in promoter strength was 
displayed by the Tandem leg/pro variant. Although this construct directed a level of 
GUS expression which was three-fold larger than that of the parent construct p2700, 
it retained specificity for the endosperm. Levels of expression in cotyledons was 
equally low with the Tandem leg/pro variant as it was with p2700. 
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CO-TRANSFORMATION 



The largest component of variance in data obtained by measurements of transient gene 
5 expression, following ballistic transformation, resides in the differences in the efficiency of 
delivery of the DNA to the target. It has been shown that one can compensate for this 
problem," and obtain more precise evaluation of transient gene expression, by co- 
transformation with a control expression vector containing a different reporter gene (Godon, 
C, Caboche, M, Daniel-Vedele, F. (1993) Biochimie 75: 591-595). By measuring the 
10 reporter activities and expressing results as the ratio of the test activity to that of the control, 
fluctuations that merely result from differences in the amount of DNA striking the target are 
accounted for. 



The RSus3 promoter variants, were also tested in this manner. The firefly luciferase gene 
15 - (luc) was fused to each of them, as a reporter, and ballistic transformation was repeated as 
described before, but two control plasmids were included in equal molar amounts to the test 
plasmid. The strength and tissue-specificities of the different promoter constructs were tested 
in modified transient expression studies of ballistically transfomed endosperms, and pre- 
emerged cotyledons, from the same guar seeds. Each experiment involved simultaneous 
20 transformation with three separate plasmids. Of these, one was a test plasmid (p2700 + 
intron/luc+, p2700 - intron/luc+, pTandem leg/pro + intron/luc+, or pTandem leg/pro - 
intron/luc+ or pDB16/luc+), in which promoters were fused to the firefly luciferase gene 
(luc+) (Figures 19, 20, 21, 22, and 23). The remaining two were control plasmids, in which 
either the uidA (GUS) gene or the renilla luciferase gene (pDB16/Rluc) were under the 
25 control of the NOS promoter (Figures 24 and 25). 2u.g of test plasmid were mixed with 2u.g of 
each control plasmid, and coating of particles with this DNA, preparation, bombardment, and 
subsequent incubation of tissue, were all performed as described above. 

Enzyme extracts of bombarded tissues were made by grinding these in liquid nitrogen, and 
30 extracting the frozen macerate with a buffer containing boric acid, that prevents gelling of the 
galactomannan in cryopreserved material. Approximately 400mg of each tissue were 
extracted with 800u1 of extraction buffer (50mM NaH 2 P0 3 , 50mM H 2 B0 3 , ImM 
dithiothreitol, ImM EDTA disodium salt, lOmg/ml bovine serum albumin, pH was adjusted 
to 7.0 by addition of sodium hydroxide). Samples were centrifuged at 12,000 g, at 4°C, for 
35 2minutes, and the resulting supernatants were used for analysis. 
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The activities of all three reporter genes were measured luminometricaliy using a Turner TD- 
20/20 luminometer (Turner Designs, Sunnyvale, CA , USA). The two luciferase acivities 
were measured sequentially in the same reaction mixture, after the 'Dual -Luciferase <g)' 
method described in Promega Technical Manual No. 40 (Promega Corporation, Madison, WI, 
5 USA). 20uJ of guar tissue extract were rapidly mixed with IOOjiI of LARII reagent in the 
apparatus, which after 2 seconds measured and integrated the light evolved over the 
subsequent 10 seconds. Immediately after this, lOOul of 'Stop and GIo®* reagent were added 
to the reaction mixture and the renilla luciferase luminescence was measured in the same 
manner. 

10 

GUS activity was measured according to the GUS-Light™ system instruction manual 
(TROPIX Inc., Bedford, MA, USA). 20ul of guar tissue extract were added to 1 80ul of GUS 
reaction buffer, and incubated at 25 D C for one hour. The tube was placed in the luminometer, 
300ul of Might emission accelerator' were added, and after 2 seconds the light emitted during 
T5 tHe subsequent 1 

The values obtained are summarised in the table below. 

Relative Integrated Light Signal 

20 



Rsus3/luciferase experiment 


Dual Luciferase^ Reporter Assay 
system (Promega) 


GUS-Light™ 
(Tropix) 




Endosperm 


Cotyledon 


Endos 
perm 


Cotyle 
don 




luc+ 


RIuc 


luc+ 


RIuc 


GUS 


GUS 


p2700+intron/luc+ 
(pDB16/Rluc and pDB16/GUS) 


7,7/ 


352,0 


1,2 


248,7 


194,5 


208,2 




7,6 


350,4 


0,8 


236,4 


165,3 


192,4 




7,6 


353,2 


0,8 


256,7 


189,8 


206,7 


P2700-intron/luc+ 

(pDB16/RIuc and pDB!6/GUS) 


20,9 


574,0 


0,6 


185,2 


269,7 


215,0 




20,9 


585,1 


0,5 


179,3 


252,1 


151,3 




21,2 


596,7 


0,5 


195,1 




228,0 
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PTandem (leg/pro)+intron/luc+ 
V^puo i o/Kiuc and puts L o/OUo) 


13,9 


249,5 


1.1 


209,2 


93,4 


166,4 




13,3 


248,8 


0,9 


202,3 


99,6 


142,0 




13,3 


247,5 


0,8 


206,6 


104,9 


182,7 


pTandem (leg/pro)-intron/luc+ 
(pDBlo/Rluc and pDB16/GUS) 


42,8 


570,1 


1.7 


205,1 


249,2 


172,3 




43,2 


591,8 


1.5 


202,6 


225,2 


156,3 




AA *7 

44, / 


O04,J 


1 c 

1.5 


213,9 




167,2 


pDB16/luc+ 

(pDB16/Rluc and pDB16/GUS) 


45,3 


565,5 


22,3 


155,5 


289,8 


206,6 




43,4 


582,0 


21,4 


176,2 


205,8 


191,8 




45,1 


623,4 


19,9 


172,4 


333,2 


206,4 



The same results are expressed in Figure 26 as 'relative strength of promoter', the value of 
which is calculated by the division of two ratios, the test ratio and the control ratio. The test 
ratio was calculated by dividing the luminescence generated by the firefly luciferase, under 
5 control of test promoter construct, by the value obtained from the control reporter gene (either 
uidA or Rluc) under the control of the NOS promoter. The control ratio was calculated by 
dividing the luminescence from the firefly luciferase by the value from the control reporter 
gene (either uidA or Rluc) when both genes were under the control of the NOS promoter. 
* Relative strength of promoter* was calculated by dividing the test ratio by the control ratio 
10 and multiplying the value obtained by one hundred. 

The res ults in Figu re 26 show a pattern that mirrors that shown in Figure 17 insofar as the 
Tandem leg/pro configuration enhances the activity of the promoter while retaining tissue 
specificity. Although it appeared from the earlier results that the Tandem leg/pro promoter 
15 construct was twice as strong as the NOS promoter strength in endosperm tissue, the results 
in Figure 26 show that these two promoters in fact display similar strength in this tissue. 
These results also show that removal of the intron from constructs seems to enhance promoter 
strength in guar tissue. This is a preferred aspect for some embodiments of the present 
invention. Hence, preferably the NOl does not comprise an intron. 

20 
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SUMMARY 



In summation, the present invention relates to a promoter and, also to a construct 
5 comprising the same. In particular the present invention relates to the use of a 
promoter for the expression of a NOI in an organism. 

All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system 

10 of the present invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the present invention. Although the present invention 
has been described in connection with specific preferred embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 

15 the ..invention which are obvious to those 

related fields are intended to be within the scope of the following claims. 
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uVDTCATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 55_ JineS 12-20 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet | [ 



Name of depositary institution 



The National Collections of Industrial and Marine 8acteria Limited (NCIMB) 



Address of depositary institution fmctuding postal codr and country) 

23 St Machar Drive 
Aberdeen 
AB2 1RY 
United Kingdom 



Date of deposit 

15 March 1999 




NCIMB 41011 


C ADDITIONAL WmCATlOHS (leave blank inapplicable) This information iscont.nued on an additional sheet Q 



in respect of those designations in which a European patent is sought, and any other designated states 
having equivalent legislation, a sample of the deposited microorganism will only be made available (a) until 
the publication of the mention of grant of the patent or (b) for twenty years from the date of filing if the 
application has been refused, withdrawn or deemed to be withdrawn, by the issue of such a sample to an 
expert nominated by the person requesting the sample. (Rule 28(4) EPC) 



P. DESIGNATED STATES FOR WHICH INDICATIONS ARE M AD E (if the indications are not /oral! designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



For receiving Office use only 



[ | This sheet was received with the international application 



Authorized officer 



Form PCI7RO/134 (July 1992) 



For International Bureau use only 



I I This sheet was received by the International Bureauon: 



Authorized officer 
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CLAIMS 

1 . A promoter comprising a nucleotide sequence corresponding to that shown as SEQ 
5 ID No. 1 or a variant, homologue, fragment or derivative thereof. 

2. A promoter having a nucleotide sequence corresponding to that shown as SEQ ID 
No. 1 or a variant, homologue, fragment or derivative thereof. 

10 3. A promoter comprising a nucleotide sequence corresponding to that shown as SEQ 
ID No. 1. 

4. A promoter having a nucleotide sequence corresponding to that shown as SEQ ID 
No. 1. 

15 

si A promoter according to any one (^ claims 1 to 4 wherein the promoter is 
obtainable from a plant of the genus Oryza. 

6. A promoter capable of causing endosperm specific expression, wherein the 
20 promoter is obtainable from a plant of the genus Oryza. 

. 7. A promoter according to any one of claims 1 to 6, wherein the promoter is operably 

linked to a NOL 

25 8. A promoter according to any one of the preceding claims wherein the promoter is 
linked to the sequence presented as SEQ ID No. 2, or a variant, homologue, derivative 
or fragment thereof. 

9. A promoter according to claim 8 wherein if an NOI is operably linked to the promoter 
30 then the sequence presented as SEQ ID No. 2, or a variant, homologue, derivative or 

fragment thereof is located intermediate the promoter of the present invention and the 
NOI. 

10. A promoter according to any one of the preceding claims wherein the promoter 
35 comprises one or more of the identified sequences presented in Table 1 or a variant, 

homologue or fragment thereof. 
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1 1 . A promoter according to claim 10 wherein the promoter comprises one or more of 
the identified sequences presented in Table 1 . 

12. A promoter according to claim 10 or claim 11 wherein the promoter comprises all 
5 of the identified sequences presented in Table 1 

13. A promoter according to any one of the preceding claims wherein the promoter is 
linked to the sequence presented as SEQ ID No. 5, or a variant, homologue, derivative 
or fragment thereof. 

10 

14. A promoter according to claim 13 wherein if an NOI is operably linked to the 
promoter then the sequence presented as SEQ ID No. 5, or a variant, homologue, 
derivative or fragment thereof is located intermediate the promoter of the present 
invention and the NOI. 



15. A construct comprising the promoter according to any one of claims 1 to 14 but 
wherein the promoter is operably linked to a NOI. 

16. An expression vector comprising the invention according to any one of claims 1 to 
20 15. 

17. A transformation vector comprising the invention according to any one of claims 1 
to 16. 

25 18. A transformed host or host cell comprising the invention according to any one of 
claims 1 to 1 6. 

19. A transformed host or host cell according to claim 18 wherein the host or host cell 
is a plant or a plant cell. 

30 

20. A method of preparing a POI, the method comprising expressing an NOI which 
encodes at least a part of the POI t wherein the NOI is operably linked to a promoter, 
optionally isolating the expression product of the NOI, forming the POI if the expression 
product of the NOI is not ali of the POI, optionally isolating the POI; wherein the 

35 promoter is the promoter according to any one of claims 1 to 19. 

21 . A method according to claim 20 wherein the NOI codes for all of the POI. 
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22. A method for expressing an NOI in endosperm, the method comprising expressing 
the NOI when it is operably linked to the promoter according to any one of claims 1 to 
20. 

5 

23. A promoter sequence obtainable from Deposit No. NCIMB 4101 1. 

24. Any one of the plasmids presented in the Experimental Section provided herein. 

10 25. Use of a sequence presented as SEQ ID No. 2, or a variant, homologue, derivative 
or fragment thereof to increase expression levels of an NOI. 

26. A promoter substantially as described herein and with reference to any one of the 
preceding claims. 
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FIG. 3 



1 • • - • • . 

. AC T TT AGAT AAT AAAG T AAG T C AC AAGAAAAAT AAAT AAT AATTC C AAATTT TTTT AATA 
TGAAATCTATTATTTCATTCAGTGTTCTTTTTATTTATTATTAAGGTTTAAAAAAATTAT 

EamllOSI 

61 . . ... 

AGACGAGTGGTCAAACAGTACAAGTAAAAACTCAAAATTCCTTATATTATGGGACTTATA 
TCTGCTCACCAGTTTGTCATGTTCATTTTTGAGTTTTAAGGAATATAATACCCTGAATAT 



121 ..... 

TTATGGGACGGAGGAAGTAGAAGATTGTAGCCAAGAAAAAAACAAAAACAAACACACCGC 
AATACCCTGCCTCCTTCATCTTCTAACATCGGTTCTTTTTTTGTTTTTGTTTGTGTGGCG 

PpulOl 
Nsil 
SphI 

181 .- ,~ w ----- ; - 

CACCTGGCAGGCATGCATCTTAGGTCGGCACATTGAGAGGTCGGCAGTAGACGAGTTACC 
GTGGACCGTCCGTACGTAGAATCCAGCCGTGTAACTCTCCAGCCGTCATCTGCTCAATGG 



Eco57I Nhel 

CTACACAACTGCTTCTTCAGTGAGCTAGCTGCATGTTCTGTTCTGCATTTACATTGCAGG 
GATGTGTTGACGAAGAAGTCACTCGATCGACGTACAAGACAAGACGTAAATGTAACGTCC 

Nhel Clal 

301 

CAGCAGCTAGCAACAGTTTGCAGGAACAATCGATAATCCATTGTGTCAGGGAGGAACATG 
GTCGTCGATCGTTGTCAAACGTCCTTGTTAGCTATTAGGTAACACAGTCCCTCCTTGTAC 

PvuII 

Bpml Esp3I BsiWI Eco57I 

361 . . . 

GAGAAAAACCGGGGCTGGAGACGAACGGGAGCAGCTGTACCGTACGTTTCTGAAGGCTGA 
CTCTTTTTGGCCCCGACCTCTGCTTGCCCTCGTCGACATGGCATGCAAAGACTTCCGACT 

Eco57l 

421 ...... 

ACCCATCTGCGAAATCCGCAGATTGGTTTGTTCAATTCCAACTTGCAGTCCTTCAGATTG 
TGGGTAGACGCTTTAGGCGTCTAACCAAACAAGTTAAGGTTGAACGTCAGGAAGTCTAAC 



481 ...... 

GTTGCATGTTCAACCGTAGTACATCTGAAAAATGAAGTGTTAAATACCTTGAGAAGACCT 
CAACGTACAAGTTGGCATCATGTAGACTTTTTACTTCACAATTTATGGAACTCTTCTGGA 

PstI 

SphI Sse8387I BstBI 
541 . « * ... • 

TCATGGAAGCATGCCTGCAGGCGATTAGCTAAGAAAAAAAAAATAAATGTACTTTTCGAA 
AGTACCTTCGTACGGACGTCCGCTAATCGATTCTTTTTTTTTTATTTACATGAAAAGCTT 
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601 ...... 

ACTTAATTTTGGAGTTAGATTTTAGGGTGTTTCCATCGTAGTGTATTTTCTACTATTGCA 
TGAATTAAAACCTCAATCTAAAATCCCACAAAGGTAGCATCACATAAAAGATGATAACGT 

Dral 

661 ..... 
GTTTAAACCGCTAATAGTCAGATATAAAATTTTATCTATAGATCATTTATAAATCATTTT 
CAAATTTGGCGATTATCAGTCTATATTTTAAAATAGATATCTAGTAAATATTTAGTAAAA 

Muni 

Bell 

721 ..... 

TAGTTGCTTCGTTCATTTTTCTACCACTTATCAACCATAGCTCAACTGATCAATTGACAA 
ATCAACGAAGCAAGTAAAAAGATGGTGAATAGTTGGTATCGAGTTGACTAGTTAACTGTT 

781 ..... 

TAAAAGTTACTAAACGACATCGCTCATCACACACCCAACGCTCACCGATGGGTGCCTCTC 
ATTTTCAATGATTTGCTGTAGCGAGTAGTGTGTGGGTTGCGAGTGGCTACCCACGGAGAG 

BssSI BsaBI 
8.41 - , .., 

GACCACGAGTTTAGCACTTGTGCAACATATATGCGTGCGATGAACATCTACTGATGCGCC 
CTGGTGCTCAAATCGTGAACACGTTGTATATACGCACGCTACTTGTAGATGACTACGCGG 

PpulOI 
BlpI Nsil 
BspHI AlwNI Esp3I SphI 

901 . . . . . 

ATGCGAATTTTAGCGTTCGTTCATGACGCTTCCAACGGCACAGAGGCTGAGCAGCAGCAT 
TACGCTTAAAATCGCAAGCAAGTACTGCGAAGGTTGCCGTGTCTCCGACTCGTCGTCGTA 

PpulOI 
Nsil 
SphI 

961 ..... 

GCATGCATGGCTCTTGTGAAAACAAAAAAGGTTACTGGTAAATGACATGCTGCTGTAGCT 
CGTACGTACCGAGAACACTTTTGTTTTTTCCAATGACCATTTACTGTACGACGACATCGA 

Ndel 
PpulOI 
Bsml Nsil 
1021 . . . ■ . 

AGTTAGCAGAATGCAAGGCCCATGCATATGCAATGCTATGCAACAAGTATAGTACCAGCA 
TCAATCGTCTTACGTTCCGGGTACGTATACGTTACGATACGTTGTTCATATCATGGTCGT 

BssSI 

1081 . . . 

TGTATGGTAGCCAGCTAACTAATCTATCAGCAGAGGCAGCAAGCTCGTGCATGGTGTGAT 
ACATACCATCGGTCGATTGATTAGATAGTCGTCTCCGTCGTTCGAGCACGTACCACACTA 

Xcml 

Bpml Ndel 
1141 ...... 

GCACTTCTCTCCAGTAATCTAGTGGTAATTTTCACCCAAAGCGTTGCTCATATGGACAGT 
CGTGAAGAGAGGTCATTAGATCACCATTAAAAGTGGGTTTCGCAACGAGTATACCTGTCA 
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Sspl 

1201 

AATTAGTAATATTACCAAGGTTCACAATCCCGTTACCTGACCAAATACTACTCACGAATG 
TTAATCATTATAATGGTTCCAAGTGTTAGGGCAATGGACTGGTTTATGATGAGTGCTTAC 



1261 ...... 

GTATCTCTGGTTTTCGTTAAAACCGTTGGTAAACCAGCAAAAATAGACAAAATTTGTCAA 
CATAGAGACCAAAAGCAATTTTGGCAACCATTTGGTCGTTTTTATCTGTTTTAAACAGTT 

Dral 

1321 

AATTTTAAATTTTAGTTTTTTTTTTAACTTAGCCGGGAAACCTTGAAGTTTGTGCTGTCG 
TTAAAATTTAAAATCAAAAAAAAAATTGAATCGGCCCTTTGGAACTTCAAACACGACAGC 

Bsgl 

1381 ...... 

AGCTGTCCTGGGAAGGACGGTTTTGGTTGGGATTGTGAACCCTGGTTACTGCACTTCATT 
TCGACAGGACCCTTCCTGCCAAAACCAACCCTAACACTTGGGACCAATGACGTGAAGTAA 



.1.4.41. 



TTTGAACAGATATTAGTGCAACAGACAAATGCCAACGCATTTTTTTCTGTTTACCGGCAA 
AAACTTGTCTATAATCACGTTGTCTGTTTACGGTTGCGTAAAAAAAGACAAATGGCCGTT 

Hindlll 

Eco57I BspMI Esp3I 

1501 ...... 

GCTGAAGCTTTTACGATCCCCATACCGCCGTTGCTGCAAACCTGCCAAGAAAGAGCAGCA 
CGACTTCGAAAATGCTAGGGGTATGGCGGCAACGACGTTTGGACGGTTCTTTCTCGTCGT 



1561 ...... 

' G AAAC AGGTGTC AT TTTGTGGTG GAAAG C CAAG T AAAGT AAAC AGAAGAT GG AAG AT AGT 
CTTTGTCCACAGTAAAACACCACCTTTCGGTTCATTTCATTTGTCTTCTACCTTCTATCA 

Bsgl 

1621 : . ... 

GAGGACCAGGGAGTGAGGCAGGGGACACATGGCCCACGCCTCCCTGCACATTTTCGTGTA 

CTCCTGGTCCCTCACTCCGTCCCCTGTGTACCGGGTGCGGAGGGACGTGTAAAAGCACAT 
PpulOI 

Nsil Esp3I 
1681 ...... 

TAAATACAGGTGGATGCATCGCTCTCCCAGCATCCATCGGTTCTCTGCTCTGTTCATCCA 
ATTTATGTCCACCTACGTAGCGAGAGGGTCGTAGGTAGCCAAGAGACGAGACAAGTAGGT 

Earl Eco57I Earl 
1741 ...... 

TAGAGTTTCCTCCTCTTCTCCTTCAGTGCAAGGTAGAGAAGAGCATGTGTGTGTGTGTGT 
ATCTCAAAGGAGGAGAAGAGGAAGTCACGTTCCATCTCTTCTCGTACACACACACACACA 

Bsgl 

1801 ...... 

GTGAACTGTGAAGTGCAGAGTGCTTCTGTAGTTCTGTGTTATGTCCATAGTGATCTTGTT 
CACTTGACACTTCACGTCTCACGAAGACATCAAGACACAATACAGGTATCACTAGAACAA 
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PpulOI 
Nsil 

1861 • 

AGGATTGTTGCTATGGATGCATGATGTTATGGTTAATCTCTGAATTACAGTAGGGACTTC 
TCCTAACAACGATACCTACGTACTACAATACCAATTAGAGACTTAATGTCATCCCTGAAG 

Bglll 

1921 

TCTGAGATCTCTGGATTAGTGGGGGGTGCTAAATTTTTTTCTGGTTGCATCAGCTTGGGT 
AGACTCTAGAGACCTAATCACCCCCCACGATTTAAAAAAAGACCAACGTAGTCGAACCCA 

1981 ...... 

TTCTGGGATTGGTGTGGGTTCTTGCTCTGAATTTTGGTTCAGAATGTCGATTTGTTTTGT 
AAGACCCTAACCACACCCAAGAACGAGACTTAAAACCAAGTCTTACAGCTAAACAAAACA 

BspMI 

ECOS7I PstI 
2041 ...... 

GTTTGCCCTCTGAAGTTGAGAGTAGCTATGATCCATCCAGCACAGAACTGCAGGTCCCTG 
CAAACGGGAGACTTCAACTCTCATCGATACTAGGTAGGTCGTGTCTTGACGTCCAGGGAC 

NgoMI 
Nael 

2101 

CCTGCCGGCAGCATATACAGGACATGCCATTTTGCAAGCTCTGGGCTTATGGTTTCTCTT 
GGAC GG CC GTC GT AT ATGTCCTGT AC GGTAAAACGTTC GAG AC CCGAATACCAAAGAGAA 

2161 ..... 
TTGGAGTTCTTCTTCTTGCATGATCTGTGTTCTCTAACAAAGAAGCAAGATTTAGCAACT 
AACCTCAAGAAGAAGAACGTACTAGACACAAGAGATTGTTTCTTCGTTCTAAATCGTTGA 

2221 ...... 

TTATTCAGAGACAAGAAAAGGATCTGGCAACCTTTTGTTTCTGTTTTATCCTACTCGTAA 
AATAAGTCTCTGTTCTTTTCCTAGACCGTTGGAAAACAAAGACAAAATAGGATGAGCATT 

Dral 

2281 ...... 

AGATTGTTATTTAAGCAAAAATTTCCCAAAAGTTTTAAATATAATTTCCATGATGTGCCA 
TCTAACAATAAATTCGTTTTTAAAGGGTTTTCAAAATTTATATTAAAGGTACTACACGGT 

2341 ..... 

CTCTCATGTCCTTGAACCTGGCACTCATTATGGGCTCCTCAGAAGTGCTGTAGCTAATGT 
GAGAGTACAGGAACTTGGACCGTGAGTAATACCCGAGGAGTCTTCACGACATCGATTACA 

BsaBI 

2401 ...... 

CACTAATCTTTTGTATCTTTGTTCGTAGTCTTGTATTTTATGATGCTTATCCCTTTGTGC 
GTGATTAGAAAACATAGAAACAAGCATCAGAACATAAAATACTACGAATAGGGAAACACG 

2461 

• TTTCCATGTTTGATGTCCAAATGTCATGGCAATGTTTTTGACTTCTAGTAGGGGTTTTAG 
AAAGGT AC AAACTAC AGGT TT AC AGTAC CGT T AC AAAAACT GAAG ATC ATCCCCAAAAT C 
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2521 ... . 

' TACCTTTTTGTTAGATAAGTACATCCAAATTCTGTTTATTTATTCAAAAATCATTCTGTT 
ATGGAAAAACAATCTATTCATGTAGGTTTAAGACAAATAAATAAGTTTTTAGTAAGACAA 

PflMI 

2581 ...... 

TATTCACTGAAAACATTTGTCCATTCAATGGAATCGTAAACTGTCTGTGTTTTTCAGGCT 
ATAAGTGACTTTTGTAAACAGGTAAGTTACCTTAGCATTTGACAGACACAAAAAGTCCGA 

BamHI 

2641 . . 2668 

TGAGGATCCAACTAGAAGATAGCAATGG 
ACTCCTAGGTTGATCTTCTATCGTTACC 
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Agarose gel showing the result of the semi-nested PCR with the primer pair AP2 and RSus3. 
Lane I and 12 molecular weight marker II; lane 2-6 EcoRV, Dral, PvuII, Seal and Sspl with 
buffer F: lane 7-1 1 same with buffer H. 
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The promoter region from P ScaK3 and P SspK3 were amplified from the primers M 1 3 forward 
and RSusNco. The amplified products contain Xhol and Ncol sites for directional cloning into 
the unique Sail and Ncol in pGUSNOSt as Xhol/Ncol fragments 

FIG. 5 



BNSOOCID: <WO 007B975A2J_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/G BOO/02641 

8/27 




BNSDOC1D: <WO. 



^0078975A2J_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/CB00/02641 

9/27 




BNSDOCIO: <WO 007B975A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/G BOO/0264 1 

10/27 




BNSOOCIO: <WO 007B975A2J_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/G BOO/02641 

11/27 




BNSDOCID: <WO. 



.0078975 A2J_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 



PCT/GBOO/02641 



12/27 



2 

Qui 



> 

c o 



s 



0. 




o 

I 

Pi! 

B 



o 

CD 

^3 IVO 



o 
o 

g-,3 CM 



If 



312 
If" 



SIS 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCIO: <WO_ 



_0078975A2J_> 



Best Available Copy 



WO 00/78975 PCT/G BOO/02641 

13/27 




007B975A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 




007B975A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



Best Available Copy 



WO 00/78975 PCT/GBOO/02641 

15/27 




Agarose gel showing the result of a digest of tandem repeat clones. Lane 1: Hindlll 
digest of pTandem leg/pro, which gives a specific fragment of 740 bp. Lane 2: Nhel 
digest of pTandem legumin, which gives a specific fragment of 570 bp. Lane 3: DNA 
marker VI. Lane 4: Hindlll digest of pTandem prolamin, which gives a specific fragment 
of 160 bp. 
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Transient GUS expression in GUAR endosperms 
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SEQUENCE LISTINGS 



5 SEQ ID NO. 1 



ACTTTAGATAATAAAGTAAGTCACAAGAAAAATAAATAATAATTCCAAATTTTTTTAATAAGA 
CGAGTGGTCAAACAGTACAAGTAAAAACTCAAAATTCCTTATATTATGGGACTTATATTATGG 
GACGGAGGAAGTAGAAGATTGTAGCCAAGAAAAAAACAAAAACAAACACACCGCCACCTGGCA 

10 GGCATGCATCTTAGGTCGGCACATTGAGAGGTCGGCAGTAGACGAGTTACCCTACACAACTGC 
TTCTTCAGTGAGCTAGCTGCATGTTCTGTTCTGCATTTACATTGCAGGCAGCAGCTAGCAACA 
GTTTGCAGGAACAATCGATAATCCATTGTGTCAGGGAGGAACATGGAGAAAAACCGGGGCTGG 
AGACGAACGGGAGCAGCTGTACCGTACGTTTCTGAAGGGTGAACCCATCTGCGAAATCCGCAG 
ATTGGTTTGTTCAATTCCAACTTGCAGTCCTTCAGATTGGTTGCATGTTCAACCGTAGTACAT 

15 CTGAAAAATGAAGTGTTAAATACCTTGAGAAGACCTTCATGGAAGCATGCCTGCAGGCGATTA 
GCTAAGAAAAAAAAAATAAATGTACTTTTCGAAACTTAATTTTGGAGTTAGATTTTAGGGTGT 
TTCCATCGTAGTGTATTTTCTACTATTGCAGTTTAAACCGCTAATAGTCAGATATAAAATTTT 
ATCTATAGATCATTTATAAATCATTTTTAGTTGCTTCGTTCATTTTTCTACCACTTATCAACC 
ATAGCTCAACTGATCAATTGACAATAAAAGTTACTAAACGACATCGCTCATCACACACCCAAC 

20 GCTCACCGATGGGTGCCTCTCGACCACGAGTTTAGCACTTGTGCAACATATATGCGTGCGATG 
AACAT C T ACTG ATG GGG G ATGGG AATT-TTAGGGT-T GGTTG ATG AGG GTTG CAAGGG CACAGAG 
GCTGAG CAG CAGCATGCATGCATGGCTCTTGTGAAAACAAAAAAGGTTACTGGTAAATGACAT 
GCTGCTGTAGCTAGTTAGCAGAATGCAAGGCCCATGCATATGCAATGCTATGCAACAAGTATA 
GTACCAGCATGTATGGTAGCCAGCTAACTAATCTATCAGCAGAGGCAGCAAGCTCGTGCATGG 

25 TGTGATGCACTTCTCTCCAGTAATCTAGTGGTAATTTTCACCCAAAGCGTTGCTCATATGGAC 
AGTAATTAGTAATATTACCAAGGTTCACAATCCCGTTACCTGACCAAATACTACTCACGAATG 
GTATCTCTGGTTTTCGTTAAAACCGTTGGTAAACCAGCAAAAATAGACAAAATTTGTCAAAAT 
TTTAAATTTTAGTTTTTTTTTTAACTTAGCCGGGAAACCTTGAAGTTTGTGCTGTCGAGCTGT 
CCTGGGAAGGACGGTTTTGGTTGGGATTGTGAACCCTGGTTACTGCACTTCATTTTTGAACAG 

3 0 AT ATTAGTGCAAC AGACAAATGCCAACG C ATTTTTTT CTGTTTACCGG CAAGCTG AAGCTTTT 
ACGATCCCCATACCGCCGTTGCTGCAAACCTGCCAAGAAAGAGCAGCAGAAACAGGTGTCATT 
TTGTGGTGGAAAGCCAAGTAAAGTAAACAGAAGATGGAAGATAGTGAGGACCAGGGAGTGAGG 
CAGGGGACACATGGCCCACGCCTCCCTGCACATTTTCGTGTATAAATACAGGTGGATGCATCG 
CTCTCCGAGCATCCATCGGTTCTCTGCTCTGTTCATCCATAGAGTTTCCTCCTCTTCTCCTTC 

35 AGTGCAAG 



SEQ ID NO. 2 



GTAGAGAAGAGGATGTGTGTGTGTGTGTGTGAACTGTGAAGTGCAGAGTGCTTCTGTAGTTCT 
40 GTGTTATGTCCATAGTGATCTTGTTAGGATTGTTGCTATGGATGCATGATGTTATGGTTAATC 
TCTGAATTACAGTAGGGACTTCTCTGAGATCTCTGGATTAGTGGGGGGTGCTAAATTTTTTTC 
TGGTTGCATCAGCTTGGGTTTCTGGGATTGGTGTGGGTTCTTGCTCTGAATTTTGGTTCAGAA 
TGTCGATTTGTTTTGTGTTTGCCCTCTGAAGTTGAGAGTAGCTATGATCCATCCAGCACAGAA 
CTGCAGGTCCCTGCCTGCCGGCAGCATATACAGGACATGCCATTTTGCAAGCTCTGGGCTTAT 
45 GGTTTCTCTTTTGGAGTTCTTCTTCTTGCATGATCTGTGTTCTCTAACAAAGAAGCAAGATTT 
AGCAACTTTATTCAGAGACAAGAAAAGGATCTGGCAACCTTTTGTTTCTGTTTTATCCTACTC 
GTAAAGATTGTTATTTAAGCAAAAATTTCCCAAAAGTTTTAAATATAATTTCCATGATGTGCC 
ACTCTCATGTCCTTGAACCTGGCACTCATTATGGGCTCCTCAGAAGTGCTGTAGCTAATGTCA 
CTAATCTTTTGTATCTTTGTTCGTAGTCTTGTATTTTATGATGCTTATCCCTTTGTGCTTTCC 
50 ATGTTTGATGTCCAAATGTCATGGCAATGTTTTTGACTTCTAGTAGGGGTTTTAGTACCTTTT 
T GTT AG ATAAGT ACAT C CAAATT CTGTTT ATTT ATT CAAAAAT CATT CT GT TTATT CACTGAA 
AACATTTGTCCATTCAATGGAATCGTAAACTGTCTGTGTTTTTCAG 
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SEQ ID NO. 3 



ACT TT AG AT AATAAAGT AAGT CAC AAG AAAAAT AAAT AAT AAT TCC AAATTTTTTT AAT AAGA 
CGAGTGGTCAAACAGTACAAGTAAAAACTCAAAATTCCTTATATTATGGGACTTATATTATGG 

5 GACGGAGGAAGTAGAAGATTGTAGCCAAGAAAAAAACAAAAACAAACACACCGCCACCTGGCA 
GGCATGCATCTTAGGTCGGCACATTGAGAGGTCGGCAGTAGACGAGTTACCCTACACAACTGC 
TTCTTCAGTGAGCTAGCTGCATGTTCTGTTCTGCATTTACATTGCAGGCAGCAGCTAGCAACA 
GTTTGCAGGAACAATCGATAATCCATTGTGTCAGGGAGGAACATGGAGAAAAACCGGGGCTGG 
AGACGAACGGGAGCAGCTGTACCGTACGTTTCTGAAGGCTGAACCCATCTGCGAAATCCGCAG 

10 ATTGGTTTGTTCAATTCCAACTTGCAGTCCTTCAGATTGGTTGCATGTTCAACCGTAGTACAT 
CTGAAAAATG AAGTGTT AAATACCTTGAGAAG ACCTTCATGG AAGC ATGCCTG CAGG CGATTA 
GCTAAGAAAAAAAAAATAAATGTACTTTTCGAAACTTAATTTTGGAGTTAGATTTTAGGGTGT 
TTCCATCGTAGTGTATTTTCTACTATTGCAGTTTAAACCGCTAATAGTCAGATATAAAATTTT 
ATCTATAGATCATTTATAAATCATTTTTAGTTGCTTCGTTCATTTTTCTACCACTTATCAACC 

15 ATAGCTCAACTGATCAATTGACAATAAAAGTTACTAAACGACATCGCTCATCACACACCCAAC 
GCTCACCGATGGGTGCCTCTCGACCACGACyTTT 

AACATCTACTGATGCGCCATGCGAATTTTAGCGTTCGTTCATGACGCTTCCAACGGCACAGAG 
G CTGAG C AG C AG C ATGCATGCATGGCTCTTGTGAAAAC AAAAAAGGTT ACTGGT AAATGACAT 
GCTGCTGTAGCTAGTTAGCAGAATGCAAGGCCCATGCATATGCAATGCTATGCAACAAGTATA 

20 GTACC AG C ATGTATGGTAGCCAG CTAACTAATCTAT CAGCAGAGGCAGCAAGCTCGTGCATGG 
TGTGATGCACTTCTCTCCAGTAATCTAGTGGTAATTTTCACCCAAAGCGTTGCTCATATGGAC 
AGTAATTAGT AATATT ACCAAGGTTCACAATCCCGTTACCTGACCAAATACTACTCACGAATG 
GTATCTCTGGTTTTCGTTAAAACCGTTGGTAAACCAGCAAAAATAGACAAAATTTGTCAAAAT 
TTTAAATTTTAGTTTTTTTTTTAACTTAGCCGGGAAACCTTGAAGTTTGTGCTGTCGAGCTGT 

25 CCTGGGAAGGACGGTTTTGGTTGGGATTGTGAACCCTGGTTACTGCACTTCATTTTTGAACAG 
ATATTAGTGCAACAGACAAATGCCAACGCATTTTTTTCTGTTTACCGGCAAGCTGAAGCTTTT 
ACGATCCCCATACCGCCGTTGCTGCAAACCTGCCAAGAAAGAGCAGCAGAAACAGGTGTCATT 
TTGTGGTGGAAAGCCAAGTAAAGTAAACAGAAGATGGAAGATAGTGAGGACCAGGGAGTGAGG 
CAGGGGACACATGGCCCACGCCTCCCTGCACATTTTCGTGTATAAATACAGGTGGATGCATCG 

30 CTC TCC CAG CATC CATCGGTTCTCTGCTCTGTT CATC CAT AGAGTTTCCTCCTCTTCTCCTTC 
AGTG C AAGG CTTG AGGATCCAACTAGAAGAT AGCAATGG 

SEQ ID NO. 4 

35 CAGGTGGATGCATCGCTCTCCCAGCATCCATCGGTTCTCTGCTCTGTTCATCCATAGAGTTTC 
CTCCTCTTCTCCTTCAGTGCAAG 
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SEQ ID NO. 5 - SECOND PART OF EXON SEQUENCE 



GC TTG AGG AT C C AAC TAG AAG AT AG C AATGG 
5 SEQ ID NO 6 

ACTTTAGATAATAAAGTAAGTCACAAGAAAAATAAATAATAATTCCAAATTTTTTTAATAAGA 
CGAGTGGTCAAACAGTACAAGTAAAAACTCAAAATTCCTTATATTATGGGACTTATATTATGG 
GACGGAGGAAGTAGAAGATTGT AG CCAAGAAAAAAAGAAAAACAAACAC ACCG CCACCTGGCA 

10 GGCATGCATCTTAGGTCGGCACATTGAGAGGTCGGCAGTAGACGAGTTACCCTACACAACTGC 
TTCTTCAGTGAGCTAGCTGCATGTTCTGTTCTGCATTTACATTGCAGGCAGCAGCTAGCAACA 
GTTTGCAGGAAGAATCGATAATCCATTGTGTCAGGGAGGAACATGGAGAAAAACCGGGGCTGG 
AGACGAACGGGAGCAGCTGTACCGTACGTTTCTGAAGGCTGAACCCATCTGCGAAATCCGCAG 
ATTGGTTTGTTCAATTCCAACTTGCAGTCCTTCAGATTGGTTGCATGTTCAACCGTAGTACAT 

15 CTGAAAAATGAAGTGTTAAATACCTTGAGAAGACCTTCATGGAAGCATGCCTGCAGGCGATTA 
G CTAAG AAAAAAAAAATAAATGT ACTTTTCG AAACTTAATTTTGGAGTTAG ATTTT AGGG TGT 
TTCCATCGTAGTGTATTTTCTACTATTGCAGTTTAAACCGCTAATAGTCAGATATAAAATTTT 
ATCTATAGATCATTTATAAATCATTTTTAGTTGCTTCGTTCATTTTTCTACCACTTATCAACC 
ATAGCTCAACTGATCAATTGACAATAAAAGTTACTAAACGACATCGCTCATCACACACCCAAC 

20 GCTCACCGATGGGTGCCTCTCGACCACGAGTTTAGCACTTGTGCAACATATATGCGTGCGATG 
AACATCTACTGATGCGCCATGCGAATTTTAGCGTTCGTTCATGACGCTTCCAACGGCACAGAG 
G CTGAG CAG CAG CATG CATG C ATGG CTCTTGTGAAAACAAAAAAG G TT ACTGGT AAATG ACAT 
GCTGCTGTAGCTAGTTAGCAGAATGC^GGCCCATGCATATGCAATGCTATGCAACAAGTATA 
GTACCAG C ATGT ATGGTAGC CAG CTAACTAATCT AT CAGC AG AGGCAGC AAGCT CGTG CATGG 

25 TGTGATGCACTTCTCTCCAGTAATCTAGTGGTAATTTTCACCCAAAGCGTTGCTCATATGGAC 
AGTAATTAGTAATATTACCAAGGTTCACAATCCCGTTACCTGACCAAATACTACTCACGAATG 
GTATCTCTGGTTTTCGTTAAAACCGTTGGTAAACCAGCAAAAATAGACAAAATTTGTCAAAAT 
TTTAAATTTTAGTTTTTTTTTTAACTTAGCCGGGAAACCTTGAAGTTTGTGCTGTCGAGCTGT 
C CTGGG AAG G ACGGTTTTGGTTGGG ATTGTGAACCCTGGTTACTG C ACTT C ATTTTTG AACAG 

30 ATATTAGTGCAACAGACAAATGCCAACGCATTTTTTTCTGTTTACCGGCAAGCTGAAGCTTTT 
ACGATCCCCATACCGCCGTTGCTGCAAACCTGCCAAGAAAGAGCAGCAGAAACAGGTGTCATT 
TTGTGGTGGAAAGCCAAGTAAAGTAAACAGAAGATGGAAGATAGTGAGGACCAGGGAGTGAGG 
CAGGGGACACATGGCCCACGCCTCCCTGCACATTTTCGTGTATAAATA 
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